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CHEMICALLY MODIFIED PROTEINS WITH A CARBOHYDRATE MOIETY 
CROSS-REFERENCE T" RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Application 
Serial No. 60/091.687, filed July 2, 1998, and U.S. Provisional Patent Application Serial 
5 No. 60/1 3 1 ,446, filed April 28, 1 999, and which are hereby incorporated by reference. 

FIF.IT> OF THF INVENTION 

The present invention relates to chemically modified mutant proteins 
having modified glycosylation patterns with respect to a precursor protein from which 

0 they are derived. In particular, the present invention relates to a chemically modified 

mutant protein including a cysteine residue substituted for a residue other than cysteine in 
a precursor protein, the substituted cysteine residue being subsequently modified by 
reacting the cysteine residue with a glycosylated thiosulfonate. The present invention 
also relates to a method of producing the chemically modified mutant proteins and a 

,5 glycosylated methanethiosulfonate. Another aspect of the present invention is a method 
of modifying the functional characteristics of a protein by reacting the protein with a 
glycosylated methanethiosulfonate reagent. The present invention also relates to methods 
of determining the structure-function relationships of chemically modified mutant 
proteins. 



20 



25 



RArcr.ROTJND OF THF. INVENTION 

Modifying enzyme properties by site-directed mutagenesis has been 
limited to natural amino acid replacements, although molecular biological strategies for 
overcoming this restriction have recently been derived (Cornish et al., Angew. Chem. , Int. 
Ed. Engl., 34:621-633 (1995)). However, the latter procedures are difficult to apply in 
most laboratories. In contrast, controlled chemical modification of enzymes offers broad 
potential for facile and flexible modification of enzyme structure, thereby opening up 
extensive possibilities for controlled tailoring of enzyme specificity. 
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Changing enzyme properties by chemical modification has been explored 
previously, with the first report being in 1966 by the groups of Bender (Polgar et al. f I 
Anv Chem. Soc . 88:3153-3154 (1966)) and Koshland (Neet et al., Proc. Natl. Acad. Sci. 
USA. 56:1606-161 1 (1966)). who created a thiolsubtilisin by chemical transformation 
5 (CH,OH CH,SH) of the active site serine residue of subtilisin BPW to cysteine. 
Interest in chemically produced artificial enzymes, including some with synthetic 
potential, was renewed by Wu (Wu et al., 1 Am. Chem. Soc., 1 1 1 :45 14-45 15 (1989); Bell 
et al.. Biochemistry , 32:3754-3762 (1993)) and Peterson (Peterson et al., Biochemistry , 
34:6616-6620 (1995)), and. more recently. Suckling (Suckling et al., Bioorr. N4ed. Chem. 

10 Letu, 3:531-534 (1993)). 

Enzymes are now widely accepted as useful catalysts in organic synthesis. 
However, natural, wild-type, enzymes can never hope to accept all structures of synthetic 
chemical interest, nor always be transformed stereospecifically into the desired 
enantiomerically pure materials needed for synthesis. This potential limitation on the 
15 synthetic applicabilities of enzymes has been recognized, and some progress has been 
made in altering their specificities in a controlled manner using the site-directed and 
random mutagenesis techniques of protein engineering. However, modifying enzyme 
properties by protein engineering is limited to making natural amino acid replacements, 
and molecular biological methods devised to overcome this restriction are not readily 
20 amenable to routine application or large scale synthesis. The generation of new 

specificities or activities obtained by chemical modification of enzymes has intrigued 
chemists for many years and continues to do so. 

U.S. Patent No. 5,208,158 to Bech et al. ("Bech") describes chemically 
modified detergent enzymes where one or more methionines have been mutated into 
25 cysteines. The cysteines are subsequently modified in order to confer upon the enzyme 
improved stability towards oxidative agents. The claimed chemical modification is the 
replacement of the thiol hydrogen with C^alkyl. 

Although Bech has described altering the oxidative stability of an enzyme 
through mutagenesis and chemical modification, it would also be desirable to develop one 
30 or more enzymes with altered properties such as activity, nucleophile specificity, 

substrate specificity, stereoselectivity, thermal stability, pH activity profile, and surface 
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binding properties for use in. for example, detergents or organic synthesis. In pantcular, 
enzymes such as subtilisins. tailored for peptide synthesis would be desirable. Enzymes 
useful for peptide synthesis have high esterase and low amidase activities. Generally, 
subtilisins do no. meet these requirements and the improvement of the esterase to amtdase 
, selee.ivi.ies of subtilisins would be desirable. However, previous attempts to tatlor 
enzymes for peptide svnthesis by lowering amidase activity have generally resulted ,n 
dramatic decreases in both es.erase and amidase activities. Previous strategtes for 
.owering the amidase activity include the use of water-miscible organic solvents (Barbas 
e,al J Am. Chen, Soc, 1 10:5162-5166 (1988); Wong e. al.. 1 Am. Chem. Se e.. 
,„ , p-945-953 (1990)t and Sears e. al., Bipjedmol^rog,. 12:423-433 (1996)) and she- 
directed mutagenesis (Abrahamsen era... fitestata 30:4 15 .-4. 59 ,1991); Bonneau 
e, al "Al.eration of the Specificity of SubtHisin BPN' by Site-Directed Mutagenests tn tut 
S1 and SI' Binding-Sires," LAm.CbmJ-, 1.3:1026-1030 (.991); and Graycare, al., 
Ann N Y Acad. Sci., 67:71-79 (1992)). However, while the ratios of esterase-.o- 
15 im^se activities were improved by Utese approaches, the absolute esterase acttvtues 
wer e,oweredconc„mitan,.y. Abrahamsen e, a.., BJcshemiSry, 30:4.5.-4,59 (.991). 
Chemical modification techniques (Nee, er al., Proc Na,. Acad. Sci. . 56:1606 (1966); 
Poltrar e. al„ LAn^Chem^, 88:3153-3154 (1966); Wu e, al., 1^3*1^, 
„ ,".45,4-45,5 (.989); and West e.a.„ UEita^ .12:5313-5320 (1990)), 
20 which pen-it the incorporation of unnatura. amino acid moieties, have a.so been applted 
,0 improve estetase .0 amidase selectivity of subtilisins. For example, chemrcal 
conversion of the catalytic triad serine (Ser221) of subrilisin .0 cysteine (Nee. e, al. 
m Acad. Sc... 56:1606(1966); Po.ga, e, a.., i^Chem^, 88:3153-3.54 (1966); 
and Nakatsuka e, al., J^mChemSoc,, ,09:3808-38.0 (.987)) or .0 selenocysteme 
, 5 (Wue.al LASLChem^. nl:45,4-4515 (1989)), and me.hy,auon of the caudyrtc 
mad hisridine (His57) of chymouypsin (West e, a,., l^Ctm^, • 12:5313-5320 
(1990)) effecred subs.an.ia, improvement in esurtase-.o-amidase selecttvmes. 
Unfortunately however, these modifications were again accompanied by 50- ,0 ,000-fo,d 

decreases in absolute esterase activity. 
, 0 Surface glycoproteins act as markers in cell-cell communication events 

that determine microbial virulence (Sharon et al., Es^toto, 30:59-75 (1995)), 
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inflammation (Lasky. Annu. Rev. Biochem. . 64:1 13-139 (1995): Weis et al., Annu. Rev. 
Biochem. . 65:441-473 (1996)), and host immune responses (Varki, Glycobiol. , 3:97-130 
( 1 993): Dwek. Chem. Rev. , 96:683-720 ( 1 996)). In addition, the correct glycosylation of 
proteins is critical to their expression and folding (Helenius, Mol. Biol. Cell. 5:253-265 
5 ( 1 994)) and increases their thermal and proteolytic stability (Opdenakker et al., FASEB J. , 
7:1330-1337 (1993)). Glycoproteins occur naturally in a number of forms (glycoforms) 
(Rademacher et al., Annu. Rev. Biochem. . 57:785-838 (1988)) that possess the same 
peptide backbone, but differ in both the nature and site of glycosylation. The differences 
exhibited (Rademacher et al., Annu. Rev. Biochem. . 57:785-838 (1988); Parekh et al.. 
,0 Biochem. . 28:7670-7679 (1989); Knight, Biotechnol.. 7:35-40 (1989)) by each 

component within these microheterogeneous mixtures present regulatory difficulties (Liu, 
Trends Bioiechnol- 10:1 14-120 (1992): Bill et al., Chem. Biol., 3:145-149 (1996)) and 
problems in determining exact function. To explore these key properties, there is a 
pressing need for methods that will not only allow the preparation of pure glycosylated 
1 5 proteins, but will also allow the preparation of non-natural variants for the determination 
of structure-function relationships, such as structure-activity relationships (SARs). The 
few studies that have compared single glycoforms successfully have required abundant 
sources and extensive chromatographic separation (Rudd et al., Biochem. , 33:17-22 
(1994)). Neoglycoproteins (Krantz et al., Biochem. . 15:3963-3968 (1976)), formed via 
20 unnatural linkages between sugars and proteins, provide an invaluable alternative source 
of carbohydrate-protein conjugates (For reviews see Stowell et al., Adv. Carbohvdr. 
Chem. Biochem. , 37:225-281 (1980); Neoplvcoconi upat e s: Prep aration and Applications , 
Lee et al., Eds., Academic Press, London (1 994): Abelson et al., Methods Enzvmol. , 242: 
(1994): Lee et al.. Methods Enzvmol. . 247: (1994); Bovin et al., Chem. Soc. Rev. , 
25 24:413-421 (1 995)). In particular, chemical glycosylation allows control of the glycan 
structure and the nature of the sugar-protein bond. However, despite these advantages, 
existing methods for their preparation (Stowell et al.. Adv. Carbohydr. Chem. Biochem. , 
37:225-281 (1 980)) typically generate mixtures. In addition, these techniques may alter 
the overall charge of the protein (Lemieux et al., 1 Am. Chem. Soc, 97:4076-4083 
30 (1975); Kobayashi et al.. Methods Enzvmol. . 247:409-418 (1994)) or destroy the cyclic 
nature of glycans introduced (Gray, Arrh Riorhem, Biophvs., 163:426-428 (1974)). For 
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example, the reductive amination of lactose with bovine serum albumin (BSA) caused 
indiscriminate modification of lysine residues through the formation of acyclic amines 
introduced (Gray, ^ tWhem. Bioohvs., 163:426-428 (1974)). Advances in the site- 
specific glycosylation of BSA have been made (Davis et al. Tetrahedron Lett. . 32:6793- 
6796 (1991): Wong et al, Biochem. J. . 300:843-850 (1994): Macindoe et al, JJ^tem, 
Sqc Them C.ommun.. 847-848 (1998)). However, these methods rely upon modification 
of an existing cysteine in BSA and, as such, allow no flexibility in the choice of 
glycosylation site. Glycoproteins occur naturally as complex mixtures of differently 
glycosylated forms which are difficult to separate. To explore their properties, there is a 
Led for homogenous sources of carbohydrate-protein conjugates. Existing methods 
typically generate product protein mixtures of poorly characterized composition, with 
little or no control over the site or level of glycosylation. 

The present invention is directed to overcoming these deficiencies. 

SUMMARY OF T HE INVENTION 

It is an object of the present invention to provide for novel glycosylated 

proteins. 

It is a further object of the invention to provide for novel glycosylated 
proteins which have modified or improved functional characteristics. 

It is a further object of the invention to provide for a method of producing 
glycosylated proteins which have well defined properties, for example, by having 
predetermined glycosylation patterns. 

According to the present invention, a method is provided wherein the 
alycosylation pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 
reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
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elycosylated thiosulfonate is an alkylthiosulfonate. most preferably a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 
chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate. said glycosylated thiosulfonate comprising a carbohydrate 
moiety; and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. 

The present invention also relates to novel glycosylated thiosulfonates. In 
a preferred embodiment, the glycosylated thiosulfonate is a methanethiosulfonate. In a 
most preferred embodiment, the glycosylated methanethiosulfonate comprises a chemical 
structure including: 

O 

II 

H 3 C— S— SR 
II 

O 

where R comprises -p-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-p-Glc(Ac) 2 , -p-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-a-Glc(Ac),, -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-P-Glc(Ac) 2 , -Et-p-Glc(Ac),, -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac),, -Et-ot-Man(Ac) 4 , 
-Et-P-Gal(Ac),, -Et-p-Gal(Ac) 4 , -Et-Lac(Ac),, -Et-Lac(Ac) 6 , or -Et-Lac(Ac),. 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including reacting the protein with a glycosylated 
thiosulfonate reagent under conditions effective to produce a glycoprotein with altered 
functional characteristics as compared to the protein. Accordingly, the present invention 
provides for modified protein, wherein the protein comprises a wholly or partially 
predetermined glycosylation pattern which differs from the glycosylation pattern of the 
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protein in its precursor, natural, or wild type state and a method for producing such a 
modified protein. 

The present invention also relates to methods of determining the structure- 
function relationships of chemically modified mutant proteins. One method includes 
5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylate pattern of the second chemically modified mutant protein 
differs from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 
,0 modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. Another method involves providing first and second chemically 
modified mutant proteins of the present invention, wherein at least one different cysteine 
residue in the second chemically modified mutant protein is modified by reacting said 
cysteine residue with a glycosylated thiosulfonate, evaluating a functional characteristic 
,5 of the first and second chemically modified mutant proteins, and correlating the 

functional characteristic of the first and second chemically modified mutant proteins with 
the structures of the first and second chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
an alternative to site-directed mutagenesis and chemical modification for introducing 
20 unnatural amino acids into proteins. Moreover, the methods of the present invention 
allow the preparation of pure glycoproteins (i.e., not mixtures) with predetermined and 
unique structures. These glycoproteins can then be used to determine structure-function 
relationships (e.g., structure-activity relationships ("SARs")) of non-natural variants of 
the proteins. 

An advantage of the present invention is that it is possible to introduce 
predetermined glycosylation patterns into proteins in a simple and repeatable manner. 
This advantage provides an ability to modify critical protein characteristics such as 
partitioning, solubility, cell-cell signaling, catalytic activity, biological activity and 
pharmacological activity. Additionally, the methods of the present invention provide for 
a mechanism of '-masking" certain chemically or biologically important protein sites, for 
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example. sites which are critical for immunological or allergenic response or sites which 
are critical to proteolytic degradation of the modified protein. 

Another advantage of the present invention is the ability to glycosylate a 
protein which is not generally glycosylated, or to modify the glycosylation pattern of a 
protein which is generally glycosylated. 

Another advantage of the present invention is to produce enzymes which 
have altered catalytic activity. In one specific example, the inventors herein have shown 
that it is possible to modify the substrate specificity of a protease to increase the esterase 
activity as compared to the amidase activity. Similarly, modifications of substrate 
specificity would be expected when utilizing the present invention with other enzymes. 

These and other advantages of the present invention are described in more 
detail in the following detailed description. 

BRIEF DESCRIPTION OF THF DRAWINGS 



,5 Figure 1 shows dendrimer methanethiosulfonate ("MTS") reagents. 

Figure 2 shows the synthesis of a first generation glycodendrimer reagent 
which bears two D-mannose units on its termini and has one arm as a MTS which can be 
attached to a subtilisin Bacillus lenlus cysteine mutant. 

Figure 3 shows the synthesis of highly-functionalized glycodendrimer- 

20 protein conjugates. 

Figure 4 shows two parallel synthetic approaches to modification of 
subtilisin Bacillus lentus with dendrimers. Both approaches allow the use of a large 
library of methanethiosulfonate reagents (R-SSO,Me) to cap the dendrimeric branches. 
The routes shown allow for the preparation of both dimeric and trimeric dendrimers 
25 Figure 5 shows peptide coupling catalyzed by an enzyme. 

Figure 6 shows the preparation of two types of glycosylating reagents from 
D-glucose (2a): the anomeric methanethiosulfonate la and the ethyl-tethered 
methanethiosulfonates 1 b, c, g, h. 

Figure 7 shows the preparation of the ot-D-manno-MTS reagents Id and li, 
30 which are epimeric at C-2 relative to 1 b and lg, respectively, and the p-D-galacto-MTS 
reagents le and lj, epimeric at C-4 relative to lc and lh, respectively. 
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Figure 8 shows the reaction of the thiol residue of cysteine introduced into 
subtilisin Bacillus lenlus with glycomethanethiosulfonate reagents. 

Figure 9 shows the reaction of 2,3.4.6-tetra-O-acetyl-fJ-D-glucopyranosyl 
methanethiosulfonate (la) with subtilisin Bacillus fcmitf-N62C, -S156C, -S166C - 
L217C. 

Figures 10A-D show deprotected glycan structure-proteolytic activity 
SARs of subtilisin Bacillus lenlus cysteine mutants and glycosylated chemically modified 
mutant enzymes ("CMMs") relative to wild-type (W). A break in the axis indicates 
that the value was not determined. At position 62, glycosylate partially restores the 
decrease in kJK M caused by mutation to cysteine (R = H) (Figure 1 OA). At position 217, 
the 4-fold decrease in activity caused by mutation (R = H) is amplified to 6-fold lower 
than WT by glycosylate with untethered S-P-Gk, but reduced to around 2.5-fold lower 
than WT by glycosylation with ethyl-tethered glycans (b-f> (Figure 10B). At position 
156, an arced variation in activity reaches a 3-fold lower than WT minimum kJK u al 
bulky lacto-CMM SI 56C-S-f (Figure 10C). At position 166. the 2.5-fold decrease in 
k JK,, caused by mutation is amplified by glycosylation. kJK„ decreases monotonically 
from S166C (R = H) to a value that is 3.8-fold lower than WT for SI 66C-S-f (Figure 
10D). 

Figures 1 1 A-D show the acetylated glycan structure-proteolytic activity 
SARs of glycosylated chemically modified mutant enzymes relative to WT. For each 
glycan the number of acetate groups present is indicated by a label on the corresponding 
bar. A break in the axis indicates that the value was not determined. At positions 62 
(Figure 1 1 A). 217 (Figure 1 IB), and 166 (Figure 1 1 D) an alternating trend in activity is 
observed as a result of the opposite effects of acetylation upon kJK u according to 
anomeric stereochemistry (see Figure 1 2). This results in a kJK„ for N62C-S-g that is 
1.1 -fold higher than WT. At position 156 (Figure 1 1C), variations are slight and this is 
consistent with its surface exposed orientation. 

Figures 12A-D show the variation in proteolytic activity of glycosylated 
chemically modified mutant enzymes of subtilisin Bacillus lentus upon acetylation of 
glycans. Comparison of the activity of acetylated with fully deprotected chemically 
modified mutant enzymes shows that at positions 62 (Figure 12A) and 217 (Figure 12B) 
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acetylation enhances the activity of a-tethered chemically modified mutant enzymes but 
decreases that of P-tethered. In contrast, at position 166 (Figure 12D), acetylation 
decreases the activity of a-tethered CMMs but increases that of p-tethered. Consistent 
with its surface exposed orientation, changes at position 1 56 (Figure 12C) are modest. 
5 For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 

Figures 1 3A-D show esterase kJK,fi for deprotected glyco-CMMs 
relative to WT. At position 62 (Figure 13A), in the S, pocket, glycosylation leads to a 
series of enzymes that have similar activities that are 1 .3- to 1 .9-fold greater than WT. 
10 At position 217 (Figure 1 3B), in the S,' pocket, glycosylation also increases kJK^ to a 
maximum 3.5-fold greater than WT for L217C-SEtGal (-e). At position 156 (Figure 
13C), in the S, pocket, glycosylation leads to a reduction in kJK hl . At position 166 
(Figure 13D), in the S, pocket, the dramatic loss of activity upon mutation to cysteine (R 
= H) is restored by glycosylation. All five S166C deprotected glyco-CMMs have similar 
1 5 kJK k f> that are 1 . 1 - to 1 .4-fold lower than WT. 

Figures 14A-D show the effect of acetylation on kJK M of glyco-CMMs. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. At 
positions 62 (Figure 14A), in the S, pocket, and 217 (Figure 14B), in the S,' pocket, the 
20 effect of acetylation is dependent on anomeric stereochemistry. At both sites, acetylation 
of a-linked sugars (-b,-d) leads to an increase in kJK^ whereas kJK M is decreased for 
p-linked sugars (-c,-e,-f). k c JK Ai a\so increases as the number of acetates increases. 
Consistent with the surface-exposed nature of its side chain, acetylation at position 156 
(Figure 14C), in the S, pocket, has very little effect on kJK Kl . At position 166 (Figure 
25 1 4D), in the S , pocket, the effects of acetylation are opposite to those observed at 
positions 62 and 217. Acetylation increases kJK M of P-linked glyco-CMMs (-c,-e), 
while causing a decrease for the a-linked glyco-CMMs (-b,-d). 

Figures 1 5A-D show the E/A of deprotected glyco-CMMs relative to WT. 
A break in the axis indicates that the value was not determined. Glycosylation with 
30 deprotected reagents 1 b-f increases the E/A ratio in all cases. The greatest effects are 

observed at positions 62, in the S 2 pocket (Figure 15A). and 217, in the S,' pocket (Figure 
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, 5B). where the largest increases in E/A result in values up to 1 0.9-fold greater than WT. 
At both sites the E/A is higher for the P-linked glyco-CMMs than me a-linked ones. At 
positions 156 (Figure 15C) and 166 (Figure 15D). in the S, pocket, there is little variation 
in E/A. 

Figures 16A-D show the effect of introducing acetylated glycans on E/A. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 
Values greater than zero indicate acetylauon increases E/A, negative values denote a 

reduction in E/A upon acetylation. 

Figure 17 shows the modeling of the high esterase activity of L217C-S- 
Glc(Ac) v A minimized structure of L217C-S-Glc(Ac) 3 of the active site of SBL showing 
the catalytic residues Ser221, His64. The carbon atoms of the triacetyated D-glucose 
moiety, which is bound to Leu217 via a disulfide bond, are numbered. The phenyl ring of 
product AAPF occupies the S, binding site and forms a crucial hydrogen bond (1 .72 A) 
to Watl27. This water molecule is further stabilized by a second hydrogen bond (1.89 A) 
to the carbonyl O of the C-2 acetate group of glucose. 

Figure 18 shows the proposed acyl-enzyme intermediate of L217C-S- 
Glc(Ac),. The carboxy terminus of AAPF forms a bond to the O r atom of Ser221 . 
Watl27 acting as the crucial deacylating nucleophilic water molecule, is stabilized 
through its hydrogen bond to the carbonyl group of the C-2 acetate group. 

nFTAll ED DE SCRIPTION OF THF INVENTION 

According to the present invention, a method is provided wherein the 
glycosylate pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteme 
in said precursor protein, the substituted cysteine residue being subsequently mod.fied by 
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reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
glycosylated thiosulfonate is an alkylthiosulfonate, most preferably, a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 

5 chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate. said glycosylated thiosulfonate comprising a carbohydrate 
moiety; and (d) obtaining a modified glycosylated protein wherein said substituted 

10 cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. The amino acid residues to 

15 be substituted with cysteine residues according to the present invention may be replaced 
using site-directed mutagenesis methods or other methods well known in the art. See, for 
example, PCT Publication No. WO 95/10615, which is hereby incorporated by reference. 

The present invention also relates to a glycosylated thiosulfonate. 
Preferably, the glycosylated thiosulfonate comprises methanethiosulfonate. More 

20 preferably, the methanethiosulfonate comprises the chemical structure: 

O 
II 

H 3 C— S— SR 



where R comprises -p-Glc, -Et-p-Gal, -Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-P-Glc(Ac) 2 , -p-Glc(Ac) 5 , -P-Glc(Ac)„, -Et-a-Glc(Ac),, -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-p-Glc(Ac),, -Et-p-Glc(Ac),, -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3s -Et-cc-Man(Ac) 4 , 
25 -Et-p-Gal(Ac),, -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 
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Accordingly. the present invention provides for modified protein, wherein the protein 
comprises a wholly or partially predetermined glycosylation pattern which differs from 
the glycosylation pattern of the protein in its precursor, natural, or wild type state and a 
method for producing such a modified protein. As used herein, glycosylation pattern 
5 means the composition of a carbohydrate moiety. The present invention also relates to 
methods of determining the structure-function relationships of chemically modified 
mutant proteins. The first method includes providing first and second chemically 
modified mutant proteins of the present invention, wherein the glycosylation pattern of 
the second chemically modified mutant protein differs from the glycosylation pattern of 
,0 the first chemically modified mutant protein, evaluating a functional characteristic of the 
first and second chemically modified mutant proteins, and correlating the functional 
characteristic of the first and second chemically modified mutant proteins with the 
structures of the first and second chemically modified mutant proteins. The second 
method involves providing first and second chemically modified mutant proteins of the 
, s present invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 
modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
20 chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
a valuable source of carbohydrate-protein conjugates. Moreover, the methods of the 
present invention allow the preparation of pure and glycoproteins (i.e., not mixtures) with 
predetermined and unique structures. These glycoproteins can then be used to determine 
25 structure-function relationships (e.g., structure-activity relationships (" SARs")) of non- 
natural variants of the proteins. 

The protein of the invention may be any protein for which a modification 
of the glycosylation pattern thereof may be desirable. For example, proteins which are 
naturally not glycosylated may be glycosylated via the invention. Similarly, proteins 
30 which exist in a naturally glycosylated form may be modified so that the glycosylation 
pattern confers improved or desirable properties to the protein. Specifically, proteins 
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useful in the present invention are those in which glycosylation plays a role in functional 
characteristics such as, for example, biological activity, chemical activity, 
pharmacological activity, or immunological activity. 

Glycosylated proteins as referred to herein means moieties having 

5 carbohydrate components which are present on proteins, peptides, or amino acids. In the 
present invention, the glycosylation is provided, for example, as a result of reaction of the 
glycosylated thiosulfonate with the thiol hydrogen of a cysteine residue thereby producing 
an amino acid residue which has bound thereto the carbohydrate component present on 
the glycosylated thiosulfonate. 

)0 Another aspect of the present invention is a method of modifying the 

functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 

The functional characteristics of a protein which may be altered by the 

15 present invention include, but are not limited to, enzymatic activity, the effect on a human 
or animal body, the ability to act as a vaccine, the tertiary structure (i.e., how the protein 
folds), whether it is allergenic, its solubility, its signaling effects, its biological activity, 
and its pharmacological activity (Paulson, "Glycoproteins: What are the Sugar Chains 
For?", Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby incorporated by 

20 reference). The use of glycosylated thiosulfonates as thiol-specific modifying reagents in 
the method of the present invention allows virtually unlimited alterations of protein 
residues. In addition, this method allows the production of pure glycoproteins with 
predetermined and unique structures and, therefore, unique functional characteristics, with 
control over both the site and level of glycosylation. In particular, the method of 

25 modifying the functional characteristics of a protein allows the preparation of single 
glycoforms through regio- and glycan-specific protein glycosylation at predetermined 
sites. Such advantages provide an array of options with respect to modification of protein 
properties which did not exist in the prior art. The ability to produce proteins having very 
specific and predictable glycosylation patterns will enable the production of proteins 

30 which have known and quantifiable effects in chemical, pharmaceutical, immunological, 
or catalytic performance. For example, with knowledge of a specific problematic epitope, 
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h would be possible to construct a modified protein according to the present invention in 
which the epitope is masked by a carbohydrate moiety, thus reducing its allergenic or 
immunogenic response in a subject. As another example, where the solubility of a protem 
is problematic in terms of recovery or formulation in a pharmaceutical or industrial 
5 application, it may be possible, utilizing the present invention, to produce a protein which 
has altered solubility profiles thus producing a more desirable protein product. As 
another example, if a protein has particular problem of being proteolytic^ unstable in 
the environment in which it is to be used, then it may be possible to mask the proteolytic 
cleavaoe sites in the protein using the present invention to cover up such a site with a 
,0 carbohydrate moiety. These examples are merely a few of the many applications of the 
present invention to produce improved proteins. 

In a preferred embodiment, the protein is an enzyme. The term "enzyme- 
includes proteins that are capable of catalyzing chemical changes in other substances 
without being changed themselves. The enzymes can be wild-type enzymes or vanant 
,5 enzymes. Enzymes within the scope of the present invention include pullulanases, 

proteases, cellulases, amylases, isomerases, lipases, oxidases, and reductases. Preferably, 
the enzyme is a protease. The enzyme can be a wild-type or mutant protease. Wild-type 
proteases can be isolated from, for example, Bacillus lentus or Bacillus amyloliauefaciens 
(also referred to as BPN'). Mutant proteases can be made according to the teachings of, 
20 for example, PCT Publication No, WO 95/10615 and WO 91/06637, which are hereby 
incorporated by reference. Functional characteristics of enzymes which are suitable for 
m odification according to the present invention include, for example, enzymatic activity, 
solubility, partitioning, cell-cell signaling, substrate specificity, substrate binding, 
stability to temperature and reagents, ability to mask an antigenic site, physiological 
o 5 functions, and pharmaceutical functions (Paulson, "Glycoproteins: What are the Sugar 
Chains For?", Trends in iwh^n. Sciences. 14:272-276 (1 989), which is hereby 

incorporated by reference). 

The protein is modified so that a non-cysteine residue is substituted with a 
cysteine residue, preferably by recombinant mean, Preferably, the amino acids replaced 
30 in the protein by cysteines are selected from the group consisting of asparagine, leucine, 
or serine. 
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The terms "thiol side chain group." "thiol containing group/ 1 and "thiol 
side chain" are terms which are can be used interchangeably and include groups that are 
used to replace the thiol hydrogen of a cysteine used to replace one of the amino acids in a 
protein. Commonly, the thiol side chain group includes a sulfur through which the thiol 
side chain groups defined above are attached to the thiol sulfur of the cysteine. 

The glycosylated thiosulfonates of the invention are those which are 
capable of reacting with a thiol hydrogen of a cysteine to produce a glycosylated amino 
acid residue. By glycosylated is meant that the thiosulfonate has bound thereto a sugar or 
carbohydrate moiety which can be transferred to a protein pursuant to the present 
invention. Preferably, the glycosylated thiosulfonates are glycosylated 
alkylthiosulfonates, most preferably, glycosylated methanethiosulfonates. Such 
glycosylated methanethiosulfonate have the general formula: 

O 
II 

H 3 C— S— SR 
O 

In particularly preferred embodiment, the methanethiosulfonate comprises 
an R group which comprises: -P-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, 
-Et-Lac, -p-Glc(Ac),, -p-Glc(Ac) 3 , -p-Glc(Ac) 4 , -Et-a-Glc(Ac),, -Et-a-Glc(Ac),, 
-Et-a-Glc(Ac) 4 , -Et-p-Glc(Ac) 2 , -Et-p-Glc(Ac),, -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) JS 
-Et-a-Man(Ac) 4 , -Et-p-Gal(Ac) 3 , -Et-P-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et- 
Lac(Ac) 7 . 

In a preferred embodiment, the carbohydrate moiety of the present 
invention is a dendrimer moiety. Multiple functionalization of chemically modified 
mutant proteins can be achieved by dendrimer approaches, whereby multiple-branched 
linking structures can be employed to create poly-functionalized chemically modified 
mutant proteins. 

Highly branched molecules or dendrimers were first synthesized by Vogtle 
in 1978 (Buhleier et al., Synthesis. 155-158 (1978), which is hereby incorporated by 
reference). The attachment of identical building blocks that contain branching sites to a 
central core may be achieved with a high degree of homogeneity and control. Each 
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branch contains a functional group which, after chemical alteration, may be connected to 
yet another branching building block. In this manner, layer after layer of branching 
rapidly generates highly-functional ized molecules. 

For instance, multiple glycosylation, including multiple mannose- 
containing chemically modified mutant proteins, and varied sugar moieties, can be 
created. The dendrimer reagent structures would include methanethiosulfonates with 
simple branching such as: 

CH 3 S0 2 S— 1 f 



derived from pentaerythritol, to very complex branched dendrimer reagents (see Figure 
1). 

In particular, a first generation glycodendrimer reagent is synthesized as 
shown in Figure 2. This approach can be extended to cover larger dendrimers. More 
specifically, by leaving one "arm" of the glycodendrimer free for conversion to a 
methanethiosulfonate, the remaining arms can be further branched to synthesize highly- 
; functionalized glycodendrimer reagents as shown in Figure 3. Through further branching 
and by using different carbohydrates, this concept can be extended to virtually unlimited 
levels. 

A flexible synthetic strategy for the synthesis of core dendrimeric 
methanethiosulfonate building blocks that may be used either in situ or before 
) modification to construct dendrimers is shown in Figure 4. 

The present invention also relates to glycosylated thiosulfonate 
compositions. Preferably the glycosylated thiosulfonates are methanethiosulfonates and 
comprise a chemical structure: 

O 
II 

H 3 C— S— SR 
O 

5 wherein R comprises -(J-Glc, ^Et-P-Gal, -Et-(J-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 

Glc(Ac) 2 , -P-Glc(Ac).,, -P-Glc(Ac) 4 , -Et-ct-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , -Et- 
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p-Glc(Ac),, -Et-p-Glc(Ac),, -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac),. -Et-a-Man(Ac) 4 , -Et-P- 
Gal(Ac) 3 , -Et-P-Gal(Ac) 4 . -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

The present invention also relates to a method of determining the structure- 
function relationships of chemically modified mutant proteins. This method involves 

5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein is 
different from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 

l o modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

Evaluating a functional characteristic of the first and second chemically 
modified mutant protein includes testing for functional characteristics including, but not 
limited to, stability to temperature and reagents, solubility, partitioning, enzymatic 

1 5 activity, cell-cell signaling, substrate specificity, substrate binding, ability to mask an 
antigenic site, physiological functions, and pharmaceutical functions (Paulson, 
"Glycoproteins: What are the Sugar Chains For?", Trends in Biochem. Sciences, 14:272- 
276 (1989), which is hereby incorporated by reference). 

Another aspect of the present invention is a second method of determining 

20 the structure-function relationships of chemically modified mutant proteins. This method 
involves providing first and second chemically modified mutant proteins of the present 
invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 

25 modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
chemically modified mutant proteins. 

By way of example to illustrate some of its advantages, the following 
discussion will focus on certain proteases which are modified according to the methods of 

30 the present invention. Alkaline serine proteases (subtilisins) are finding increasing use in 
biocatalysis, particularly in chiral resolution, regioselective acylation of polyfunction^ 
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compounds, peptide coupling, and glycopeptide synthesis. As shown in Figure 5, 
subtilisins can catalyze peptide bond formation starting from an ester substrate, by first 
forming an acyl enzyme intermediate which then reacts with a primary amine to form the 
peptide product. This application requires high esterase activity to promote acyl enzyme 
5 formation and low amidase activity to minimize hydrolysis of the peptide bond of the 
desired product. Generally, subtilisins do not meet these requirements. However, the 
improvement of the esterase to amidase selectivities of subtilisins has been a long sought 
after goal. By using the methods provided for in the present invention, it is possible to 
produce subtilisins which have advantageous properties. 
I0 The inventors in the present case used site specific mutagenesis to modify 

certain residues and introduce additional cysteine residues within subtilisin which would 
then serve to react with a glycosylated methanethiosulfonate to produce a glycosylation 
point at the introduced cysteine. Bacillus lenius subtilisin was selected for illustrative 
purposes because it does not contain a natural cysteine and is not naturally glycosylated. 
, 5 The substrate binding site of an enzyme consists of a series of subsites 

across the surface of the enzyme. The portion of substrate that corresponds to the subsites 
are labeled P and the subsites are labeled S. By convention, the subsites are labeled S„ 
S„ S 3 , S 4 , S,\ and S 2 \ A discussion of subsites can be found in Berger et al., Phil. Trans. 
Ro v Roc. Lond. B . 257:249-264 (1970), Siezen et al., Protein Engineering , 4:719-737 
20 (1991), and Fersht ^ Structure and Mechanism. 2 ed„ Freeman: New York, 29-30 
(1985), which are hereby incorporated by reference. 

In the present illustration, the S„ S,\ or S, subsites were selected as 
suitable targets for modification. In particular, the amino acids corresponding to N62, 
L217, SI 56, and S166 in naturally-occurring subtilisin from Bacillus amyloliquefaciens 
25 or to equivalent amino acid residues in other subtilisins, such as Bacillus lentus subtilisin, 
were selected for modification to cysteine. The mutated subtilisin was produced through 
standard site directed mutagenesis techniques and the obtained mutant subtilisin was 
reacted with certain glycosylated alkylthiosulfonates, particularly glycosylated 
methanethiosulfonates, as provided in the examples appended hereto. 
30 Enzymatic peptide coupling is an attractive method for preparation of a 

variety of peptides, because this method requires minimal protection of the substrate, 
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proceeds under mild conditions, and does not cause racemization. Wong et al., Enzvmes 
in Synthetic Organic Chemistry , Pergamon Press: Oxford, 41-130 (1994), which is hereby 
incorporated by reference. In spite of these advantages, two major problems have limited 
the use of serine proteases in peptide synthesis. One is their efficient proteolytic 

5 (amidase) activity which causes hydrolysis of the coupling product, and the other is their 
stringent structural specificity and stereospecificity. 

Surprisingly, it was found that the chemically modified mutant subtilisins 
of the present invention have altered esterase-to-amidase activity as compared to the 
precursor enzyme. Increasing the esterase-to-amidase ratio enables the use of the enzyme 

10 to more efficiently catalyze peptide synthesis. In particular, subtilisins can catalyze 
peptide bond formation starting from an ester substrate (i.e. an acyl donor), by first 
forming an acyl enzyme intermediate which then reacts with a primary amine (i.e. an acyl 
acceptor) to form the peptide product, as shown in Figure 5. This reaction thus requires 
high esterase activity to promote acyl enzyme formation and. then, low amidase activity 

1 5 to minimize hydrolysis of the peptide bond of the desired product. The chemically 
modified mutant subtilisins produced according to the present invention show an 
increased esterase-to-amidase ratio, without reducing the absolute esterase activity of the 
enzyme. In addition, certain modified enzymes of the present invention even show a 
concomitant increase in the absolute esterase activity. 

20 Therefore, an unexpected benefit of subtilisins which are modified 

according to the present invention is that they can be used in organic synthesis to, for 
example, catalyze a desired reaction and/or favor a certain stereoselectivity. See e.g., 
Noritomi et al. Biotech. Bioeng. 51 :95-99 (1996); Dabulis et al. Biotech. Bioeng.41 :566- 
571 (1993), and Fitzpatrick et al. J. Am. Chem. Soc. 1 13:3166-3171 (1991), which are 

25 hereby incorporated by reference. 

Proteins obtained using the methods provided herein may be used in any 
application in which it is desired to use such proteins, where having modified functional 
capabilities is advantageous. Thus, proteins modified as provided herein may be used in 
the medical field for pharmaceutical compositions and in diagnostic preparations. 

30 Additionally, proteins such as enzymes which are modified according to the present 
invention may be used in applications which are generally known for such enzymes 
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including industrial applications such as cleaning products, textile processing, feed 
modification, food modification, brewing of grain beverages, starch processing, as 
antimicrobials, and in personal care formulations. Moreover, the unique functionalities 
made possible by the present invention may result in uses for proteins which have not 
5 heretofore been recognized as feasible. 

FXAMPLES 

F.xamnle 1 -Preparation of Methanethiosulfonate ("MTS") Reagents 

, 0 The preparation of NaSSO,CH, (Kenyon et ah, Methods Enzvmol., 

47:407-430 (1 977), which is hereby incorporated by reference) has been described 
previously (Berglund et ah, 1 Anv Chem. Soc . 1 19:5265-5266 (1997), which is hereby 
incorporated by reference). Acetobromoglucose (3) (See Figure 6) (prepared from D- 
glucose according to Scheurer et al., I Am. Chem. Soc, 76:3224 (1 954), which is hereby 

1 5 incorporated by reference) in 73% yield, pentaacetylglucose (prepared from the 

corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch, 
Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5d (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 

20 Chem. Ges. . 34:3354-3358 (1901), which is hereby incorporated by reference, and 

purified by flash chromatography) in 92% yield, 5e (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 
Chem. Ges. . 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5f (See Figure 7) (prepared from lactose 

25 according to the method of Hudson et al., J_ Am. Chem. Soc, 37:1270-1275 (1915), 

which is hereby incorporated by reference, and purified by flash chromatography in 82% 
yield) were prepared according to literature methods. N,N-dimethylformamide ("DMF") 
was distilled under N 2 from CaH 2 and stored over molecular sieve under N 2 before use. 
Methanol was distilled from Mg/I, under N 2 immediately prior to use. Br(CH 2 ) 2 OH was 
30 stood over and distilled from CaO under reduced pressure and stored under N 2 prior to 
use. All other chemicals were used as received from Sigma-Aldrich (St. Louis, MO) or 
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Baker (Phillipsburg, NJ). All flash chromatography was performed using silica gel 
(Whatman, 60A, 230-400 Mesh, Clifton, NJ). Melting points were determined using an 
Electrothermal IA9000 series digital melting point apparatus and are uncorrected. IR 
spectra were recorded on Bomem MB or Perkin-Elmer FTIR Spectrum 1000 

5 spectrophotometers. 'H NMR and U C NMR spectra were recorded on Varian Gemini 
200. Unity 400 or Unity 500 NMR spectrometers at the frequencies indicated. Where 
indicated, NMR peak assignments were made using Correlation Spectroscopy ("COSY") 
or Distortionless Enhanced by Polarization Transfer ("DEPT") experiments, all others are 
subjective. All chemical shifts were referenced to residual solvent as an internal standard; 

1 o for ,3 C NMR in D,0 1 ,4-dioxan (67.6 ppm) was used. ES-MS data were acquired using a 
PE SC1EX API 111 Biomolecular mass spectrometer. All HRMS data were acquired using 
Micromass 70-250S or Micromass ZAB-SE mass spectrometers according to the 
ionization methods indicated. Solvents were removed in vacuo. 

1 5 Preparation of2,3,4,6-Tetra-0-acetyl-{i-D-glucopyranosyl methanelhiosulfonate (la). 

Initial approaches to untethered glycq-MTS reagents similar in type to la 
(See Figure 8) were based upon Danishefsky's glycal methodology (Halcomb et al., L 
Am Chem. Soc . 1 1 1 :666 1-6666 (1989), which is hereby incorporated by reference). 
Tris-TBS glucal was prepared according to the method of Lesimple et al., Tetrahedron 

20 Lett, 27:6201-6204 (1986), which is hereby incorporated by reference, and oxidized to 
tris-TBS protected 1,2-anhydroglucose using dimethyldioxirane. However, under a 
variety of conditions and in contrast to the behavior of other sulfur nucleophiles (Gordon 
et al., Carbohvdr. Res. , 206:361-366 (1990); Berkowitz et al, J. Am. Chem. Soc , 
1 14:4518-4529 (1992), which are hereby incorporated by reference), 

25 methanelhiosulfonate ion failed to open the epoxide moiety of tris-TBS protected 1 ,2- 
anhydro-D-glucose. Deprotection of la (See Figure 8) was attempted under a variety of 
conditions, but in all cases led only to decomposition or hydrolysis of the thioglucosidic 
bond (Zemplen et al., TW.h Chem. Ges.. 56:1705-1710 (1923); Planner et al., L 
Am. Chem. Soc , 94:8613-8615 (1972); Mori et al., Tetrahedron Lett., 20:1329-1332 

30 (1979); Herzig et al., Carbohvdr. Res. , 153:162-167 (1986); Herzig et al., J. Org. Chem. ,, 
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51 :727-730 (1986); Vekemans et al., Tetrahedron Lett. . 28:2299-2300 (1987); Cinget et 
al., Svnlett , 168-170 (1993), which are hereby incorporated by reference). 

Acetobromoglucose (3) (See Figure 6) (1 g, 2.43 mmol) was added to a 
solution of NaSSO,CH 3 (380 mg, 2.84 mmol) in ethanol (4 mL) at 90°C under N,. After 

5 20 minutes the resulting suspension was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 9:11) and the resulting solid 
recrystallized from ether to give la (See Figure 8) (674 mg. 63%) as a white solid; mp 
151-152°C melts then decomp. (ether); [a]" D = -19.0 (c 1 .24, CHC1 3 ); IR (KBr) 1749 cm" 
, (C=ox 1333) ! 140 cm"' (S-S0 2 ); 'H NMR (400 MHz, CDC1 3 ) 5 2.00, 2.04, 2.06, 2.07 (s 

,0 x 4, 3H x 4, Ac x 4), 3.44 (s, 3H, CH 3 S0 2 -), 3.82 (ddd, J 4 5 10.1 Hz, J 5 . 6 5.9 Hz, J 5t - 2.2 
Hz, lH,H-5),4.08(dd,A 6 5.9Hz,J 66 .12.5Hz, 1H,H-6),4.31 (dd, J 5A . 2.2 Hz, J 66 - 12.5 
Hz, lH,H-6'),5.05 (t,J9.8 Hz, lH,H-4),5.07 (dd,./,, 10.5 Hz, J 2J 9.4 Hz, 1H, H-2), 
5.25 (d,J t , 10.5 Hz,lH, H-l), 5.29 (t,J9.3 Hz, 1H, H-3); »C NMR (50 MHz, CDC1 3 ) 5 
20.5, 20.7 (CH 3 COO- x 4), 52.8 (CH 3 SO r ), 61 .8, 68.0, 68.7, 73.3, 76.6 (C-2, C-3, C-4, 

,5 C-5,C-6), 86.4 (C-l), 169.3, 169.3, 169.7, 170.1 (CH3COO- x 4); HRMS m/z (EI+): 
Found 443.0636 (M+H*); C l5 H 23 O n S 2 requires 443.0682. 

Preparation of 2-(2,3,4,6-Tetra-0-acetyl-a.-D-glucopyranosyl)ethyl methanethiosulfonate 
(lg>. 

20 BF 3 .Et 2 0 ( 1 45 uL, 1 . 1 mmol) was added dropwise to a suspension of D- 

glucose (2a) (See Figure 6) (1.45 g, 8.1 mmol) in Br(CH 2 ) 2 OH (19 mL) under N 2 and the 
resulting mixture heated to 105°C. After 8 hours, the resulting solution was cooled and 
the solvent removed. The residue was dissolved in Ac 2 0/pyridine (2:3 v/v, 16 mL) under 
N 2 . After a further 24 hours, the reaction solvent was removed and the residue purified by 

25 repeated flash chromatography (EtOAc then EtOAc : hexane, 3:7) to give 2-bromoethyl 
2,3,4,6-tetra-O-acetyl-a-D-glucopyranoside (4g) (See Figure 6) (1.76 g, 48%) as a 
colorless oil that crystallized on standing to give a white solid; mp 86-88°C; [a] 2 ' D = + 
130.6 (c 0.21, CHC1 5 ); IR (film) 1749 cm" 1 (OO); 'H NMR (400 MHz, CDC1 3 ) 6 2.01, 
2.03, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.5 1 (t, J 5.9 Hz, 2H, -CH 2 Br), 3.83 (dt, J A 1 1 .6 

30 Hz, J, 5.8 Hz, 1H, -OCHH'-), 3.96 (dt, J, 1 1.6 Hz, J, 5.8 Hz, 1H, -OCHH1-), 4.10 (dd, J iA 
2.2 Hz,J 6 . 6 - 12.0 Hz, 1H, H-6), 4.14 (ddd,J 4 . 5 10.2 Hz, J 5M 2.2 Hz, J SA . 4.4 Hz, 1H, H-5), 
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4.24 (dd, Ae-4.4 Hz,J 66 12.0 Hz, 1H, H-6 ? ), 4.84 (dd, J,., 3.8 Hz, J 2 . 3 10.3 Hz, 1H, H-2), 
5.05 (t, J 9.1 Hz, 1H, H-4), 5.14 (d, J u 3.8 Hz, 1H, H-l), 5.49 (dd, M } 10.3 Hz, J iA 9.5 
Hz, 1H, H-3); ,3 C NMR (100 MHz, CDC1 3 ) 8 20.6, 20.7 (CH 3 COO- x 4), 29.9 (-CH 2 Br), 
61.9, 67.8, 68.5, 68.8, 70.0, 70.8 (-OCH 2 -, C-2, C-3, C-4, C-5, C-6), 96.0 (C-l), 169.6, 

5 170.0, 170.2, 170.6 (CHjCOO- x 4); HRMS m/z (FAB+): Found 477.0381 (M+Na>, 
C l6 H 23 O l0 BrNa requires 477.0372. NaSSOXH, (75 mg, 0.56 mmol) was added to a 
solution of 4g (See Figure 6) (190 mg, 0.42 mmol) in DMF (6 mL) under N 2 and warmed 
to 50 °C. After 21 hours, the solution was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 1 :1) to give lg (See Figure 6) 

10 (1 83 mg, 90%) as a colorless oil; [<x] 27 D = + 92.1 (c 0.39, CHC1 3 ); IR (film) 1 748 cm"' 

(C=0), 1322, 1 134 cm' 1 (S-SO,); 'H NMR (400 MHz, CDC1.,) 5 2.01, 2.03, 2.07, 2.09 (s x 
4, 3H x 4, Ac x 4), 3.41 (t, J 5.7Hz, 2H, -CH 2 S-), 3.41 (s, 3H, CH 3 S0 2 -), 3.75 (dt, J d 10.8 
Hz, J, 5.7 Hz, 1H, -OCHH'-), 3.99-4.06 (m, 2H, H-5, -OCHH», 4.09 (dd, J 56 2.4 Hz, J 66 - 
12.6 Hz, 1H, H-6), 4.25 (dd, J 56 . 4.6 Hz, J 6A - 12.6 Hz, 1H, H-6'), 4.87 (dd, J l2 3.9 Hz, J 2 . 3 

15 10.3 Hz, 1H, H-2), 5.06 (t, J9.8 Hz, 1H, H-4), 5.12 {d,J xa 3.9 Hz, 1H,H-1), 5.43 (t, J9.8 
Hz, 1H, H-3); ,3 C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.6, 20.7, 20.7 (CH,COO- x 4), 36.0, 
(-CH 2 S-), 50.8 (CH,SO,-), 61.8 (-OCH 2 -), 67.0, 67.8, 68.3, 69.8, 70.7 (C-2, C-3, C-4, C-5, 
C-6), 96.0 (C-l), 169.5, 170.0, 170.6 (CH 3 COO- x 4); HRMS m/z (FAB+): Found 
487.0946 (M+H + ); C l7 H 27 0 I2 S 2 requires 487.0944. 

20 

Preparation of 2-(a-D-Glucopyranosyl) ethyl methanethiosulfonate (lb). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4g 

(See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N, and stirred vigorously. 

After 6 hours, the resulting solution was passed through a Dowex 50W(H + ) plug (2 x 1 
25 cm, eluant MeOH) and the solvent removed to give 2-bromoethyl a-D-glucopyranoside 

bromide (4b) (See Figure 6) (178 mg, 94%) (The synthesis of 4b as an intermediate has 

been described previously. However, this method gave only a poor yield of product. 

Nagai et al., Carbohvdr. Res. , 190:165-180 (1989), which is hereby incorporated by 

reference) as a white solid. NaSSO,CH 3 (100 mg, 0.75 mmol) was added to a solution of 
30 4b (See Figure 6) (1 78 mg, 0.62 mmol) in DMF (7 mL) under N 2 and warmed to 50 °C. 

After 25 hours, the solution was cooled and the solvent removed. The residue was 
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purified by flash chromatography (MeOH : EtOAc, 1 :9) to give lb (See Figure 6) (144 
mg, 73%) as a hygroscopic foam; [ot] 27 D = + 1 09.9 (c- 1.11, H 2 0); IR (film) 3423 cm"' 
(OH), 1309, 1 128 cm" 1 (S-SO,); 'H NMR (500 MHz, D,0, COSY) 6 3.16 (t, J 9.5 Hz, 1H, 
H-4), 3.28 (t, J 5.9 Hz, 2H, -CH 2 S-), 3.30 (s, 3H, CH 3 S0 2 -), 3.3 1 (dd, J x , 3.8 Hz, 9.9 
Hz, 1H, H-2), 3.44 (t, J9.5 Hz, 1H, H-3), 3.47-3.53 (m, 2H, H-6, H-6'), 3.58-3.61 (m, 
1 H, H-5), 3.62 (dt, J { 5.4 Hz, J A 1 0.8 Hz, 1 H, -OCHH-). 3.79 (dt, J, 6.3 Hz, J„ 10.8 Hz, 
1H. -OCHHD, 4.72 (d, J x2 3.8 Hz, 1H, H-l); ,3 C NMR (100 MHz, D,0) 5 36.7 (-CH 2 S-), 
50.7 (CH,SO r ), 61.5 (-OCH 2 -), 67.2, 70.5, 72.3, 73.2, 74.0 (C-2, C-3, C-4, C-5, C-6), 
99.4 (C-l); HRMS m/z (FAB+): Found 319.0517 (M+H^): C 9 H„0 8 S 2 requires 319.0521. 

Preparation of2-(2,3,4, 6-Tetra-0-acetyl-^-D-glucopyranosyl)ethyl methanethiosulfonate 
(lh). 

BF,.Et,0 (3.3 mL, 26.0 mmol) was added dropwise over the course of 15 
minutes to a solution of 1,2,3 ,4,6-penta-0-acetyl-<x,|3-D-glucose (2 g, 5.1 mmol) and 
Br(CH 2 ) 2 OH (0.45 mL, 6.3 mmol) in CH 2 C1 2 (9 mL) at 0 °C under N 2 . After 1 .5 hours, 
the solution was warmed to room temperature. After.20 hours the reaction solution was 
added to ice water (15 mL) and extracted with CH 2 C1 2 (1 5 mL x 3). These extracts were 
combined, washed with water (15 mL), sat. NaHC0 3 (aq., 15 mL), water (15 mL), dried 
(MgSO,), filtered, and the solvent removed. The residue was purified by flash 
chromatography (EtOAc : hexane, 1 :3) to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-P-D- 
glucopyranoside (4h) (See Figure 6) (1.42 g, 61%) as a white solid; mp 118-120 °C 
(EtOActoo-octane) [lit., (Coles et al., J, Am. Chem. Soc, 60:1020-1022 (1938), which is 
hereby incorporated by reference) mp 1 1 7.3 °C (EtOH)]; [a] 27 D = - 1 1 -9 (c 1 .65, CHC1 3 ) 
[lit., (Helferich et al., Just I.ieh. Ann. Chem. . 541 :1-16 (1939), which is hereby 
incorporated by reference) [a]» D = - 12.3 (c 0.2, CHC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 5 
2.00, 2.02, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.42-3.51 (m, 2H), 3.67-3.87 (m, 2H), 4.10- 
4.3l'(m,3H),4.57(d,J,. 2 8Hz, 1H, H-l), 4.97-5.27 (m, 3H). NaSS0 2 CH 3 (260 mg, 1.94 
mmol) was added to a solution of 4h (See Figure 6) (640 mg, 1.41 mmol) in DMF (18 
mL) under N 2 and warmed to 50°C. After 25 hours, the solution was cooled and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 
1 :1) and the resulting solid recrystallized from EtOAc/hexane to give lh (See Figure 6) 
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(544 mg, 80%) as a white solid; mp 1 15-1 16°C (EtOAc/hexane); [a] 27 D = + 5.4 (c 1 .06, 
CHC1,); IR (KBr) 1758, 1741 cm"' (C=0), 1314, 1 133 cm"' (S-S0 2 ); 'H NMR (500 MHz, 
CDC1,, COSY) 8 1 .99, 2.02, 2.06, 2.08 (s x 4, 3H x 4, Ac x 4), 3.30-3.38 (m, 2H. -CH 2 S- 
), 3.34 (s, 3H, CH,SO,-), 3.70 (ddd, J 4 . 5 9.9 Hz. J y6 2.2 Hz, J 5 . 6 - 4.6 Hz, 1H, H-5), 3.83 

5 (ddd, J 5.6 Hz, J7.4 Hz, J 10.5 Hz, 1H, -OCHH'-), 4.13-4.18 (m, 2H, H-6, -OCHH», 
4.24 (dd, J iA - 4.6 Hz, J bA - 12.4 Hz, 1H, H-6'), 4.55 (d,J i2 8.1 Hz, 1H, H-l), 4.98 (dd, J x2 
8.1 Hz, J 2J 9.7 Hz, 1H, H-2), 5.07 (W9.9 Hz, 1H, H-4), 5.19 (t, J 9.6 Hz, 1H, H-3); U C 
NMR (125 MHz, CDC1,, DEPT) 6 20.5, 20.7 (q x 2, CH,COO- x 4), 36.0, (t, -CH 2 S-), 
50.6 (q. CH,S0 2 -), 61.6 (t, -OCH 2 -), 68.1, 70.8, 71.9, 72.5 (d x 4, C-2, C-3, C-4, C-5), 

l0 68.4 (t, C-6), 100.8 (d, C-l), 169.3, 170.0, 170.5 (s x 3, CH^COO- x 4); HRMS mJz 
(FAB+): Found 487.0940 (M+H + ); C l7 H 27 0 12 S, requires 487.0944. 

Preparation of 2-($-D-Glucopyranosyl)ethyl methanethiosulfonate (1c). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4h 

15 (See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N, and stirred vigorously. 
After 4 hours, the resulting solution was passed through a Dowex 50W(H + ) plug (2 x 1 
cm, eluant MeOH) and the solvent removed to give 2-bromoethyl |J-D-glucopyranoside 4c 
(See Figure 6) (1 76 mg, 93%) as a white solid that was used directly in the next step. A 
sample was recrystallized from EtOH/EtOAc to give a colorless, crystalline solid; mp 74- 

20 78°C (EtOH/EtOAc) [lit., (Helferich et al., lust. Lieb. Ann. Chem., 541:1-16 (1939), 
which is hereby incorporated by reference) mp 74-75°C (EtOH/EtOAc)]; [<x] 26 D = - 22.4 
(c 1.63, H 2 0) [lit., (Helferich et al., Inst. T.ieh. Ann. Chem.. 541:1-16 (1939), which is 
hereby incorporated by reference) [o]" D = - 26.1 (c 3.0, H 2 0)]; 'H NMR (400 MHz, 
CD3OD) 6 3.30 (t, J 8.4 Hz, 1H, H-2), 3.39-3.49 (m, 3H), 3.64-3.80 (m, 3H), 3.97 (br d, 

25 J** 1 1.7 Hz, 1H, H-6'), 4.02 (dt, J, 6.5 Hz, J d 1 1 .3 Hz, 1H, -OCHH'-), 4.23 (dt, J K 6.5 Hz, 
J A 1 1 .3 Hz, 1H, -OCHH1-), 4.44 (d, J l2 7.9 Hz, 1H, H-l). NaSS0 2 CH 3 (100 mg, 0.75 
mmol) was added to a solution of 4c (See Figure 6) (1 76 mg, 0.61 mmol) in DMF (7 mL) 
under N 2 and wanned to 50°C. After 15 hours, the solution was cooled and the solvent 
removed. The residue was purified by flash chromatography (MeOH : EtOAc, 1 :9) to 

30 give 1 c (See Figure 6) (1 44 mg, 74%) as a hygroscopic foam; [a] 27 D = - 1 5.8 (c 0.88, 
H 2 0); IR (KBr) 3400 cm" 1 (OH), 1310, 1131 cm'' (S-SO,); 'H NMR (500 MHz, D 2 0, 
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COSY) 8 3.07 (dd, J, 2 8.1 Hz, 9.4 Hz, IH. H-2), 3.16 (dd, J» 9.0 Hz, J u 9.8 Hz, IH, 
H-4) 3.24 (ddd, J A 5 9.8 Hz, ./, 6 6.0 Hz, A, 2.3 Hz, 1 H, H-5), 3.27 (t, J 9.0 Hz, 1 H, H-3), 
3 30-3 33 (m, 2H, -CH,S-), 3.34 (s, 3H, CH 3 SC,-), 3.50 (dd, J 5A 6.0 Hz, J,, 12.4 Hz, IH, 
H-6). 3.69 (dd, J Sb . 2.3 Hz, J 6 ., 12.4 Hz, IH, H-6'), 3.81 (dt,J ( 5.8 Hz,./, 11.5 Hz, IH, - 
OCHH'-). 4.00 (dt, J, 5.7 Hz, J d 1 1 .4 Hz, IH, -OCHH», 4.30 (d, J l3 8.1 Hz, 1 H, H-l); 
»C NMR (50 MHz, D,0) 8 36.9 (-CH 2 S-), 51.0 (CH 3 S0 2 -), 62.0 (-OOL-), 69.5, 70.9, 
74.3, 76.7, 77.3 (C-2, C-3, C-4, C-5, C-6), 103.7 (C-1); HRMS m/z (FAB+): Found 
341.0351 (M+Na + ); C 9 H, 8 0 8 S 2 Na requires 341.0341. 

Preparation of 2-(2,3,4,6-Tetra-0-acetyl-a-D-mannopyranosyl)ethyl 
methanethiosulfonate (li). 

BF 3 .Et,0 (7.7 mL, 60.7 mmol) was added dropwise over the course of 1 5 
minutes to a solution of l,2,3,4,6-penta-0-acetyl-a,P-D-mannose (5d) (See Figure 7) (4.7 
g, 12.1 mmol) and Br(CH 2 ) 2 OH (1 .05 mL, 14.8 mmol) in CH 2 C1 2 (22 mL) at 0°C under 
N 2 After 1 hour, the solution was warmed to room temperature. After 25 hours, the 
reaction solution was added to ice water (20 mL) and extracted with CH 2 C1 2 (20 mL x 2). 
These extracts were combined, washed with water (20 mL), sat. NaHC0 3 (aq., 20 mL), 
water (20 mL), dried (MgS0 4 ), filtered, and the solvent removed. The residue was 
crystallized from EtOAc/Zso-octane to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-a-D- 
mannopyranoside (4i) (See Figure 7) (3.52 g, 64%). Purification of the resulting mother 
Hquor by flash chromatography (EtOAc:hexane, 1 :3) gave further 4i (See Figure 7) (320 
mg, 6%; 70o/o in total) as a white highly crystalline solid; mp 121-123°C [lit., (Dahmen et 
■I, "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohydr. Res. , 
1 16 303-307 (1983), which is hereby incorporated by reference) 118-1 19°C (EtOAc/*o- 
octane)]- [a]\ = + 48.3 (c 1 .31 , CHC1 3 ) [lit., (Dahmen et al., «2-Bromoethyl Glycosides 
- Synthesis and Characterization," Ca I bohyd r ^es i , 1 16:303-307 (1983), which is hereby 
incorporated by reference) [af D = + 45 (c 0.6, CDCl,)]; >H NMR (200 MHz, CDCLJ 8 
1 99 205 2.l0,2.16(sx4,3Hx4,Acx4),3.52(W6Hz,2H,-CH 2 Br),3.82-4.04 (m , 

2H -OCH,-) 4.09-4.16 (m, IH, H-5), 4.13 (dd,y„ 2 Hz, J,, 12 Hz, IH, H-6) 4.28 (dd, 
J.le Hz,y 66 - 12 Hz, IH, H-6'), 4.87 (br s, IH, H-l), 5.22-5.40 (m, 3H, H-2, H-3, H-4). 
NaSSO,CH 3 (230 mg, 1.72 mmol) was added to a solution of 4i (See Figure 7) (600 mg, 
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I . 32 mmol) in DMF (17 mL) under N 2 and warmed to 55°C. After 20 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(EtOAc : hexane, 9:1 1) and the resulting solid recrystallized from Et 2 0/hexane to give li 
(See Figure 7) (566 mg, 88%) as a white solid; mp 128-129°C (Et,0/hexane); [a] :, D - + 

5 53.2 (c 0.92, CHC1.0; IR (KBr) 1739 cm"' (C=0), 1325, 1 129 cm"' (S-SO,); 'H NMR (500 
MHz, CDC1 3 , COSY) 5 1.97, 2.04, 2.09, 2.14 (s x 4, 3H x 4, Ac x 4), 3.37-3.40 (m, 2H, - 
CH,S-), 3.38 (s, 3H, CH,SO,-), 3.79 (dt,J d 10.5 Hz, J, 5.8 Hz, 1H, -OCHH'-), 3.98-4.03 
(m, 2H, -OCHHI-, H-5), 4.09 (dd, J 5A 2.5 Hz, J bA - 12.5 Hz, 1 H, H-6), 4.26 (dd, J iA - 5.6 
Hz, J M - 12.5 Hz, 1H, H-6'), 4.85 (d, J l2 0.7 Hz, 1H, H-l), 5.23-5.29 (m, 3H, H-2, H-3, H- 

10 4); ,3 C NMR (50 MHz, CDC1 3 ) 8 20.6, 20.7, 20.8 (CH 3 COO- x 4), 35.7, (-CH 2 S-), 50.8 
(CH 3 SO,-), 62.5 (-OCH,-), 66.0, 66.8, 69.0, 69.2, 69.3 (C-2, C-3, C-4, C-5, C-6), 97.7 (C- 
1), 169.7, 169.9, 170.0, 170.6 (CH3COO- x 4); HRMS m/z (FAB+): Found 487.0954 
(M+H*); C I7 H 27 0,,S 2 requires 487.0944. 

1 5 Preparation of 2-(a-D-Mannopyranosyl)ethyl methanethiosulfonate (Id). 

A solution of NaOMe (0.143 M, 0.7 mL) was added to a suspension of 4i 
(See Figure 7) (1 g, 2.2 mmol) in MeOH (10 mL) under N,. After 3 hours, the resulting 
solution was passed through a Dowex 50W(H*) plug (2 x 1 cm, eluant MeOH) and the 
solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 

20 2:25) to give 2-bromoethyl ct-D-mannopyranoside 4d (See Figure 7) (The use of 4d as a 
reactant has been described previously, although no details of preparation or 
characterization were given. (U.S. Patent 4 918 009 to Nilsson, which is hereby 
incorporated by reference)) (606 mg, 96%) as a white foam; [a] 26 D = + 50.7 (c 0.91 , H 2 0); 
IR (KBr) 3417 cm" 1 (OH); 'H NMR (500 MHz, D,0, COSY) 5 3.38-3.44 (m, 3H, H-4, - 

25 CH 2 Br), 3.50-3.55 (m, 2H, H-5, H-6), 3.60 (dd, J 23 3.5 Hz, J 3A 9.7 Hz, 1H, H-3), 3.66 (dd, 
J iA . 4.6 Hz, J^. 1 1 .2 Hz, 1 H, H-6'), 3.68 (ddd, J 4.6 Hz, J 5.4 Hz, J 1 1 .7 Hz, 1H, - 
OCHH'-), 3.76 (dd, 1.8 Hz, 3.5 Hz, 1H, H-2), 3.81 (ddd, J5.1 Hz, J 6.5 Hz, J 

II. 7, lH,-OCHH>),4.71 (d,J u 1.8 Hz, 1H, H-l); ,3 C NMR (100 MHz, D 2 0) 8 32.1 (- 
CH 2 Br), 61.7 (-OCH 2 -), 67.5, 68.4, 70.7, 71.3, 73.8 (C-2, C-3, C-4, C-5, C-6), 100.5 (C- 

30 1 ); HRMS m/z (FAB+): Found 308.9985 (M+Na*); C 8 H 15 0 5 79 BrNa requires 308.9950. 
NaSS0 2 CH 3 (150 mg, 1.12 mmol) was added to a solution of 4d (See Figure 7) (245 mg, 
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0.85 mmol) in DMF (10 mL) under N 2 and warmed to 50°C. After 16 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(MeOH : EtOAc, 1 :9) to give Id (See Figure 7) (217 mg, 80%) as a hygroscopic foam; 
[af D = + 58.0 (c 1.34, H 2 0); IR (KBr) 3441 cm" 1 (OH), 1314, 1132 cm" 1 (S-SO,); 'H 

5 NMR (500 MHz, D 2 0) 8 3.31 (t, J 5.8 Hz, 2H, -CH 2 S-), 3.35 (s, 3H, CH 3 SO r ), 3.45 (t, J 
9.6 Hz. 1 H, H-4), 3.49 (ddd, J A , 9.8 Hz, J iA 5.8 Hz, J iA - 1 .9 Hz, 1 H, H-5), 3.55 (dd, J ib 5.8 
Hz, J 66 12.1 Hz, 1H, H-6), 3.60 (dd, J 2 , 3.4 Hz, J iA 9.0 Hz, 1H, H-3), 3.66 (dt, J d 10.7 Hz, 
j t 5.7Hz, 1H, -OCHH'-), 3.69 (dd, J SA - 1.9 Hz, J bA - 12.1 Hz, 1H, H-6'). 3.77 (dd, J,, 1.6 
Hz, J-> , 3.4 Hz, 1H, H-2), 3.83 (dt, J d 1 1 .0 Hz, J, 5.9 Hz, 1H, -OCHH1-), 4.72 (d, J u 1 .6 

10 Hz! 1H, H-l); l5 C NMR (125 MHz, D 2 0) 8 36.7 (-CH,S-), 50.7 (CH 3 SO r ), 61 .9 (-OCH 2 - 
), 66.7, 67.7, 70.9, 71.5, 74.0 (C2, C-3, C-4, C-5, C-6), 100.8 (CI); HRMS m/z (FAB+): 
Found 319.0528 (M+H*)i C 9 H l9 O g S 2 requires 319.0521. 

Preparatio n of2-(2JJ,6-Tetra-0-acetyl-^-D-galactopyra n osyl)ethyl 

15 methanethiosulfonate (lj). 

BF 3 .Et,0 (8.5 mL, 67.0 mmol) was added dropwise to a solution of of 
l,2,3,4,6-penta-0-acetyl-a,P-D-galactose (5e) (See Figure 7) (5.1 g, 13.1 mmol) and 
Br(CHl),OH(1.15mL, 16.2 mmol) in CH 2 C1 2 (24 mL) at 0°C under N,. After 1 hour, the 

20 solution was warmed to room temperature. After 24 hours, the reaction solution was 

added to ice water (20 mL) and extracted with CH 2 C1 2 (30 mL x 3). These extracts were 
combined, washed with water (20 mL), sat. NaHC0 3 (aq., 20 mL), water (20 mL), dried 
(MgS0 4 ), filt Vogtle ered, and the solvent removed. The residue purified by flash 
chromatography (EtOAc : hexane, 1 : 3) to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-P-D- 

25 galactopyranos.de (4j) (See Figure 7) (4.01 g, 67%) as a white solid; mp 1 16-1 17°C 
(EtOAc/hexane) [lit., (Coles et ah, T Am. Chem. Soc, 60:1020-1022 (1938), which is 
hereby incorporated by reference) 1 1 PC; lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res., 1 16:303-307 (1983), which is hereby 
incorporated by reference) 1 14-1 16°C (EtOAc/light pet. ether)]; [a] 27 D = - 3.8 (c 0.81, 
30 CHC10 [lit., (Dahmen et al., "2-Bromoethyl Glycosides - Synthesis and 

Characterization," r.rhohvdr. Res. . 1 16:303-307 (1983), which is hereby incorporated by 
reference) [af D = - 5 (c 1 .4, CDC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 8 1 .98, 2.05, 2.08, 
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2.15 (s x 4, 3H x 4, Ac x 4), 3.43-3.50 (m, 2H), 3.75-3.95 (m, 2H), 4.12-4.24 (m, 3H), 
4.53 (d,J,. 2 8 Hz, 1H, H-l), 5.02 (dd, J u 11 Hz, J iA 3 Hz, 1H, H-3), 5.23 (dd, J l2 8 Hz, 
J 2J 1 1 Hz, 1H, H-2), 5.40 (br d, J, 4 3 Hz, 1H, H-4). NaSSO,CH 3 (85 mg, 0.63 mmol) was 
added to a solution of 4j (See Figure 7) (223 mg, 0.49 mmol) in DMF (6 mL) under N 2 

5 and warmed to 55°C. After 30 hours, the solution was cooled and the solvent removed. 
The residue was purified by flash chromatography (EtOAc : hexane, 1 :1) to give 1 j (See 
Figure 7) (198 mg, 83%) as a white foam; [a] 2 ' D = + 9.1 (c 1 .41, CHC1 3 ); IR (film) 1747 
cm" 1 (OO), 1320, 1 133 cm"' (S-SO,); 'H NMR (500 MHz, CDC1 3 ) 8 1.98, 2.05, 2.09, 
2.15 (s x 4, 3H x 4, Ac x 4), 3.35 (s, 3H, CH 3 SO r ), 3.35-3.38 (m, 2H, -CH 2 S-), 3.84 

10 (ddd,J6.1 Hz, J 7.1 Hz, J 1 0.5 Hz, 1H, -OCHH'-), 3.92 (td, J 4J 1.1 Hz, J, 6.6 Hz, 1H, H- 
5), 4.10-4.21 (m, 3H, H-6, H-6\ -OCHH», 4.52 (d, J K2 8.0 Hz, 1H, H-l), 5.01 (dd, J u 
10.3 Hz,y 34 3.5 Hz, 1H, H-3), 5.20 (dd, J x2 8.0 Hz, J 23 10.3 Hz, 1H, H-2), 5.40(dd, J iA 
3.5 Hz, J 4 . 5 1.1 Hz, 1H, H-4); ,3 C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.7, 20.8 (CH 3 COO- 
x 4), 36.1, (-CH 2 S-), 50.6 (CH 3 SO r ), 61.2 (-OCH,-), 67.0, 68.3, 68.5, 70.8, 71.0 (C-2, C- 

15 3, C-4, C-5, C-6), 101.3 (C-l), 169.5, 170.0, 170.1, 170.4 (CHjCOO- x 4); HRMS m/z 
(FAB+): Found 487.0936 (M+H + ); C l7 H 27 0 I2 S 2 requires 487.0944. 

Preparation of 2-($-D-Galactopyranosyl)ethyl methanethiosulfonate (le). 

A solution of NaOMe (0.104 M, 0.8 mL) was added to a solution of 4j 

20 (See Figure 7) (778 mg, 1 .7 1 mmol) in MeOH ( 1 0 mL) under N 2 . After 4 hours, the 
reaction solution was passed through a Dowex 50W(H') plug (3 x 1 cm, eluant MeOH) 
and the solvent removed to give 2-bromoethyl p-D-galactopyranoside (4e) (See Figure 7) 
(450 mg, 92%) (The synthesis of unstable 4e has been described previously. Dahmen et 
aL, "2-Bromoethyl Glycosides .4.2-Bromoethyl Glycosides in Glycoside Synthesis - 

25 Preparation of Glycoproteins Containing Alpha-L-Fuc-( 1 ->2)-D-Gal and Beta-D-Gal-( 1 - 
>4)-D-Glcnac," Carbohvdr. Res. , 125:237-245 (1984), which is hereby incorporated by 
reference) as a white solid which was used directly in the next step. NaSS0 2 CH 3 (180 
mg, 1.34 mmol) was added to a solution 4e (See Figure 7) (290 mg, 1.01 mmol) in DMF 
(12 mL) under N 2 and warmed to 50°C. After 15 hours, the solution was cooled and the 

30 solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 
1:9) to give le (See Figure 7) (229 mg, 71%) as a white foam; [a] 27 D = + 2.9 (c 0.58, 
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H 2 0); 1R (film) 3358 cm' 1 (br, O-H), 1306, 1 120 cm ' (S-S0 2 ); 'H NMR (500 MHz, D 2 0, 
COSY) 5 3.29-3.33 (m, 2H, -CH 2 S-), 3.30 (dd, J, 2 7.7 Hz, J» 10.0 Hz, 1H, H-2), 3.35 (s, 
3H, CH 3 SO r ), 3.43 (dd, A, 10.0 Hz, J XA 3.6 Hz, 1H, H-3), 3.48 (ddd, J AJ 0.9 Hz, J y6 4.3 
Hz, J„. 7.9 Hz, 1 H, H-5), 3.52 (dd, J 5A 4.3 Hz, J„. 1 1 .7 Hz, 1 H, H-6), 3.57 (dd, J, 6 7.9 

Hz! Jur 1 1 7 Hz > 1H ' H - 6 '>' 3 70 (dd ' J » 3 6 HZ ' Jj 5 0 9 H2 ' 1H ' H " 4> ' 3 ' 8 ° Ji 1 1 2 
Hz, J, 6.1 Hz, 1H,-OCHH'-),4.01 (dt,J d 11.4 Hz,7,5.8 Hz, 1H, -OCHH», 4.24 (d,J,. 2 

7.7 Hz. 1H, H-l); ,5 C NMR (100 MHz, D,0) 8 36.7 (-CH 2 S-), 50.8 (CH 3 S0 2 -), 61 .9 (- 

OCH 2 ), 69.2, 69.6, 71.7, 73.7, 76.2 (C-2, C-3, C-4, C-5, C-6), 104.0 (C-l); HRMS m/z 

(FAB+): Found 319.0523 (M+H + ); C 9 H l9 0 8 S 2 requires 319.0521. 

Preparation of2-(2, 3, 6-Tri-0-acetyl-4-0-(2,3. 4, 6-tetra-O-acetyU^-D-galactopyranosyl)- 
p-D-glucopyranosyl)ethyl methanethiosulfonale (lk). 

BF 3 .Et,0 (4.0 mL, 3 1 .5 mmol) was added dropwise to a solution of 

1 5 1 ,2,3,6-tetra-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-P-D-galactopyranosyl)-(5-D- 

glucopyranoside (51) (See Figure 7) (5 g, 7.4 mmol) and Br(CH 2 ) 2 OH (0.65 mL, 9.2 
mmol) in CH 2 C1 2 (15 mL) at 0°C under N 2 . After 1 hour, the solution was warmed to 
room temperature. After 20 hours, the reaction solution was added to ice water (15 mL) 
and extracted with CH 2 C1 2 (20 mL x 2). These extracts were combined, washed with 

20 water (20 mL), sat. NaHC0 3 (aq., 20 mL), water (20 mL), dried (MgSO,), filtered, and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 1 : 
1 ) to give 2-bromoethyl 2,3,6-tri-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-P-D- 
' galactopyranosyl)-P-D-glucopyranoside (4k) (See Figure 7) (2.94 g, 53%) as a white 
foam; [af D = - 7.8 (c 1 .28, CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - 

25 Synthesis and Characterization," Carbohvdr. Res., 1 16:303-307 (1983), which is hereby 
incorporated by reference) [af D - 1 1 <c 1 J, CHC1 3 )]; 'H NMR (500 MHz, CDC1 3 , 
COSY) 5 1 .94, 2.02, 2.02 (s x 3, 3H x 3, Ac x 3), 2.04 (s, 6H, Ac x 2), 2.10, 2.13 (s x 2, 
3H x 2, Ac x 2), 3.38-3.46 (m, 2H, -CH 2 Br), 3.59 (ddd, J rjr 2.2 Hz, J 4.9 Hz, J 9.9 Hz, 
1H, H-5'), 3.75-3.80 (m, 2H, H-4', -OCHH'-), 3.85 (td, J 4J 1.1 Hz, J t 6.9 Hz, 1H, H-5), 

30 4.o'3-4.12 (m, 4H, H-6, H-6', H-6", -OCHH», 4.45 (d, J ia 7.8 Hz, 1H, H-l), 4.48 (dd, 
J y 6 ... 2.2 Hz, J 6 ' jt " 12.1 Hz, lH,H-6'"),4.50(d,J,, 2 .7.9Hz, 1H, H-l '), 4.89 (dd, J vx 7.9 
Hz, J r r 9.6 Hz, 1H, H-2'), 4.92 (dd, J 2 , 10.5 Hz, J 3 . 4 3.4 Hz, 1H, H-3), 5.08 (dd, J, 2 7.8 
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Hz,J 2 . 3 10.5 Hz, 1H, H-2), 5.18 (t, J 9.6Hz, 1H, H-3'), 5.32 (dd, J 3 . 4 3.4 Hz, J 4J 1.1 Hz, 
1H, H-4). NaSSO,CH 3 (87 mg, 0.65 mmol) was added to a solution of 4k (See Figure 7) 
(357 mg, 0.48 mmol) in DMF (6 mL) under N, and warmed to 50 °C After 22 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 

5 chromatography (EtOAc : hexane, 1 1 :9) to give 1 k (See Figure 7) (327 mg, 88%) as a 
white foam; [<x] 27 D = - 3.7 (c 1.0, CHCi 3 ); IR (KBr) 1751 cm"' (C=0), 1323, 1 134 cm' 1 (S- 
S0 2 ); 'H NMR (500 MHz, CDC1 3 , COSY) 5 1.94, 2.02, 2.02, 2.04, 2.04, 2.1 1, 2.13 (s x 7, 
3H x 7, Ac x 7), 3.29-3.40 (m, 2H, -CH 2 S-), 3.32 (s, 3H, CH,SO,-), 3.59 (ddd, J 4 . 5 - 9.9 
Hz, J 5 , 6 -- 4.9 Hz, J 5 , 6 ... 2.2 Hz, 1 H, H-5'), 3.77 (t, J 9.5 Hz, 1 H, H-4'), 3.79-3.86 (m, 2H, 

10 H-5, -OCHH'-), 4.03-4.13 (m, 4H, H-6, H-6', H-6", -OCHH», 4.46 (d, J l2 7.8 Hz, 1H, 
H-l), 4.50 (d, J VJ . 8.0 Hz, 1H, H-l '), 4.52 (dd, J 5%6 ~ 2.2 Hz, J 6 ... 6 - 1 1.9 Hz, 1H, H-6'"), 
4.87 (dd, J VJ2 - 8.0 Hz, J 2 ._ r 9.6 Hz, 1 H, H-2'), 4.93 (dd, J 2 . 3 10.5 Hz, J iA 3.5 Hz, 1H, H-3), 
5.08 (dd, J,. 2 7.8 Hz, J 2 . 3 10.5 Hz, 1H, H-2), 5.17 (t, J9.3Hz, 1H, H-3'), 5.32 (dd, J iA 3.5 
Hz,y 4 . 5 1.0 Hz, 1H, H-4); ,3 C NMR (125 MHz, CDC1 3 , DEPT) 8 20.5, 20.7, 20.8, 20.9 (q 

15 x 4, CH 3 COO- x 7), 36.0 (t, -CH,S-), 50.6 (q, CH 3 SO r ), 60.7, 61 .6, 68.5 (t x 3, -OCH r , 
C-6, C-6' ), 66.5, 69.0, 70.6, 70.9, 71.3, 72.6, 72.8, 76.0 (d x 8, C-2, C-3, C-4, C-5, C-2', 
C-3', C-4', C-5'), 100.7, 101.1 (d x 2, C-l, C-l'), 169.1, 169.7, 169.7, 170.1, 170.2, 
170.3, 170.4 (s x 7, CH^OO- x 7); HRMS m/z (FAB+): Found 775.1793 (M+H + ); 
C 29 H 43 O 20 S 2 requires 775.1789. 

20 

Preparation of2-(4-0-$-D-Galactopyranosyl-$-D-glucopyranosyl)ethyl 
methanethiosulfonate (If). 

A solution of NaOMe (0.1 M, 0.6 mL) was added to a solution of 4k (See 
25 Figure 7) (877 mg, 1 . 1 8 mmol) in MeOH (6 mL) under N,. After 3 hours, the reaction 
solution was passed through a Dowex 50W(H + ) plug (4 x 1 cm, eluant MeOH) and the 
solvent removed to give 2-bromoethyl 4-0-(5-D-galactopyranosyl-p-D-glucopyranoside 
(41) (See Figure 7) (476 mg, 90%) as a white foam which was used directly in the next 
step. NaSSO,CH 3 (185 mg, 1.38 mmol) was added to a solution of 4f (See Figure 7) (476 
30 mg, 1 .06 mmol) in DMF (24 mL) under N 2 and warmed to 50°C. After 2 1 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
chromatography (CHC1 3 : MeOH : AcOH : H 2 0, 60:30:3:5) to give If (See Figure 7) (346 
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mg, 68%) as a hygroscopic foam; [a] 28 D = + 1 .5 (c 1 .66, H 2 0); IR (KBr) 341 6 cm' 1 (br, 
OH), 1311, 1131 cm" 1 (S-S0 2 ); 'H NMR (400 MHz, D 2 0, COSY) 5 3.10-3.13 (m, 1H, H- 
2), 3.30 (t, J 6.0 Hz, 2H -CH 2 S-), 3.31 (dcU. v 7.8 Hz, J rs 10.3 Hz, 1H, H-2'), 3.34 (s, 
3H, CH,SO,-), 3.38-3.54 (m, 5H), 3.44 (dd, J rs 10.3 Hz, J yA - 3.3 Hz, 1H, H-3'), 3.57 

5 (dd, J 8.4 Hz, J 1 1.4 Hz, 1 H), 3.59 (dd, J 4.9 Hz, J 7.3 Hz, 1 H), 3.70 (br d, J yA - 3.3 Hz, 
1H, H-4'), 3.77 (dd, J 1.0 Hz, J 1 1.5 Hz, 1H), 3.79-3.83 (m, 1H, -OCHH'-), 3.97-4.02 (m, 
1H, -OCHHl-),4.22 (d,J, v 7.8 Hz, 1H, H-D, 4.33 (d,J, 2 7.8 Hz, 1H, H-l); l3 C NMR 
(125 MHz, D 2 0) 8 36.7 (-CH 2 S-), 50.8 (CH,SO r ), 61.0, 62.1, 69.4, 69.6, 71.9. 73.5. 73.7, 
75.3, 75.9, 76.4, 79.3 (-OCH,-, C-2, C-3, C-4, C-5, C-6, C-2\ C-3', C-4\ C-5\ C-6') s 

10 103.3, 103.9 (C-l. C-l'); HRMS m/z (FAB+): Found 503.0886 (M+Na*); C )5 H 28 0,,S 2 Na 
requires 503.0869. 

Example 2 - General Procedure for Modification of Subtilisin Bacillus lentus 
("SBL") Mutants Stored as Flash-Frozen Solutions 



15 



A 1.25 mL frozen aliquot of the mutant enzyme (SBL-N62C, -L217C, or 
-S166C) containing approximately 25 mg of enzyme was thawed and added to 1.25 mL of 
Modifying Buffer (see below) in a polypropylene test-tube. To this solution was added 
100 uL of a 0.2 M glyco-MTS reagent solution (la,g-k in MeCN, lb-fin water (See 

20 Figure 8)). The mixture was sealed, vortexed, and placed on an end-over-end rotator at 
room temperature. When the modification was complete (determined by a specific 
activity assay, using succinyl-AlaAlaProPhe-/?-nitroanilide [e 410 = 8800 M"' cm' 1 ] 
(Bonneau et al., "Alteration of the Specificity of Subtilisin BPN' by Site-Directed 
Mutagenesis in itsSl and gi ' Binding-Sites " J Am. Chem. Soc. 1 19:1026-1030 (1991), 

25 which is hereby incorporated by reference) as substrate in 0.1 M Tris-HCl buffer 

containing 0.005% Tween 80, 1% DMSO, pH 8.6 showing constant activity and titration 
with Ellman's reagent (e 4l2 = 13600 M' 1 cm" 1 ) (Ellman et al., Biochem. Pharmacol., 7:88- 
95 (1961), which is hereby incorporated by reference) showing no free thiol present in 
solution), a further 50 uL of the modifying reagent solution was added and the mixture 

30 placed back on the end-over-end rotator for a further 1 0 minutes. The reaction was 

poured onto a pre-packed, pre-equilibrated G-25 Sephadex* PD10 column and eluted with 
3.5 mL Quench Buffer (see below). The eluant was dialysed at 4°C against 10 mM MES, 
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1 mM CaCl, pH 5.8 (2 x 1L. 2 x 45 minutes). The resulting dialysate was flash frozen in 
liquid nitrogen and stored at -1 8°C. 



Modifying Buffer: 



pH 9.5 
pH 7.5 
pH 6.5 
pH 5.5 



140 mM CHES, 2 mM CaCl 2 
140 mM HEPES, 2 mM CaCl 2 
140 mM MES, 2 mM CaCl 2 
140 mM MES, 2 mM CaCl 2 



Quench Buffer: Reactions at pH 7.5 - 9.5: 5 mM MES 1 mM CaCl 2 pH 6.5 
10 Reactions at pH 5.5: 5 mM MES 1 mM CaCl, pH 5.5 



The free thiol content of all chemically modified mutant enzymes ("CMMs"), was 
determined spectrophotometrically by titration with Ellman's reagent (Ellman et al., 
Riochem. Pharmacol. . 7:88-95 (1961), which is hereby incorporated by reference) in 

15 phosphate buffer 0.25 M, pH 8.0. In all cases, no free thiol was detected. Modified 
enzymes were analyzed by nondenaturing gradient (8-25%) gels at,pH 4.2, run towards 
the cathode, on the Pharmacia Phast-system and appeared as a single band. Prior to ES- 
MS analysis, CMMs were purified by FPLC (BioRad, Biologic System, Hercules, CA) on 
a Source 15 RPC matrix (17-0727-20 from Pharmacia, Bridgewater, NJ) with 5% 

20 acetonitrile, 0.01% TFA as the running buffer and eluted with 80% acetonitrile, 0.01% 
TFA in a one step gradient. MS m/z (ES-MS): N62C-S-a (See Figure 8) calculated 
27049, found 27051; N62C-S-b (See Figure 8) calculated 26925, found 26928; N62C-S-C 
(See Figure 8) calculated 26925, found 26928; N62C-S-d (See Figure 8) calculated 
26925, found 26925; N62C-S-e (See Figure 8) calculated 26925, found 26925; N62C-S-f 

25 (See Figure 8) calculated 27087, found 27087; N62C-S-g (See Figure 8) calculated 

27093, found 27096; N62C-S-Et-P-Glc(Ac) 2 calculated 27009, found 27015; N62C-S-Et- 
P-Glc(Ac) 3 calculated 27051, found 27053; N62C-S-i (See Figure 8) calculated 27093, 
found 27098; N62C-S-Et-p-Gal(Ac) 3 calculated 27051, found 2705 1 ; N62C-S-k (See 
Figure 8) calculated 27381, found 27386; L217C-S-p-Glc calculated 26882, found 26879; 

30 L2 1 7C-S-p-Glc( Ac), calculated 26966, found 26962; L2 1 7C-S-p-Glc(Ac) 3 calculated 
27008, found 27006; L217C-S-b (See Figure 8) calculated 26926, found 26928; L217C- 
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S-c (See Figure 8) calculated 26926, found 26925; L217C-S-d (See Figure 8) calculated 
26926, found 26925; L217C-S-e (See Figure 8) calculated 26926, found 26928; L217C- 
S-f (See Figure 8) calculated 27088, found 27087; L217C-S-Et-a-Glc(Ac), calculated 
27010, found 27012; L21 7C-S-Et-p-Glc(Ac) 3 calculated 27052, found 27056; L217C-S- 
Et-ot-Man(Ac) 3 calculated 27052, found 27056; L217C-S-Et-P-Gal(Ac). calculated 
27052, found 27053; L217C-S-Et-Lac(Ac) 6 calculated 27340, found 27342; S166C-S-a 
(See Figure 8) calculated 27076, found 27080; S166C-S-D (See Figure 8) calculated 
26952, found 26955; S166C-S-C (See Figure 8) calculated 26952, found 26950; S166C-S- 
d (See Figure 8) calculated 26952, found 26952; S166C-S-e (See Figure 8) calculated 
26952, found 26952; S 1 66C-S-f (See Figure 8) calculated 271 14, found 271 12; S166C-S- 
Et-ct-Glc(Ac) 3 calculated 27078, found 27078; S166C-S-Et-(3-Glc(Ac) 2 calculated 27036, 
found 27040; S166C-S-Et-P-Glc(Ac) 3 (major) with S166C-S-H (See Figure 8) (minor) 
and S166C-S-Et-P-Glc(Ac), (minor) calculated 27078 (major), 27120 (minor), 27036 
(minor), found 27081 (major), 27121 (minor), 27036 (minor); S166C-S-Et-o>Man(Ac), 
calculated 27078, found 27085; S166C-S-Et-p-Gal(Ac) 3 calculated 27078, found 27079; 
S 1 66C-S-Et-Lac(Ac) 5 calculated 27324, found 2733 1 . 

Example 3 - General procedure for modification of SBL mutants stored as 
lyophilized powders 



This procedure was only used with SI 56C, which is stored as a lyophilized 
powder to prevent dimerization. Into a polypropylene test tube was weighed about 25 - 
30 mg of lyophilized S156C. This was dissolved in the following modifying buffers (2.5 
mL): 

25 P H 9.5: 70 mM CHES, 2 mM CaCl 2 

pH 7.5 : 70 mM HEPES, 2 mM CaCl, 
pH 6.5: 70 mM MES, 2 mM CaCl 2 
pH 5.5: 70 mM MES, 2 mM CaCl, 

30 Glyco-MTS reagent was added and the reaction then proceeded as for the other mutants, 
using the appropriate quench buffer. MS m/z (ES-MS): SI 56C-S-a (See Figure 8) 
calculated 27076, found 27079; S156C-S-b (See Figure 8) calculated 26952, found 
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26955; S156C-S-C (See Figure 8) calculated 26952, found 26952; S156C-S-d (See Figure 
8) calculated 26952, found 26952; S156C-S-e (See Figure 8) calculated 26952, found 
26952; S 1 56C-S-f (See Figure 8) calculated 271 14, found 271 15; S156C-S-g (See Figure 
8) calculated 27120, found 27123; S156C-S-h (See Figure 8) calculated 27120, found 
27122; S156C-S-i (See Figure 8) calculated 27120, found 27123; S156C-S-j (See Figure 
8) calculated 27120, found 27120; S156C-S-U (See Figure 8) calculated 27408, found 
27411. 

Example 4 - Contents of Acetylated Glyco-CMM Libraries 

The levels of acetylation of glyco-CMMs after modification of SBL 
cysteine mutants with la, g-k (See Figure 8) at various pH levels were determined and are 
set forth in Tables 1 and 2, below. 

Table 1. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
M utants with la at various pH' 

^Reagent J a la la 

P H9.5 pH7.5 pH5.5 




N62C 4 - 4 

S156C 4-4 
S166C 4 - 4 

L217C 0 2 b 3 b 

1 Isolated as single species unless indicated. b Single product mass by ES-MS. 



WO 00/01712 



PCT/US99/15138 



-37- 

Table 2. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la,g-k at pH 5.5 a 



\Reagent 
Enzyme\ 


la 


lg 


Ih 


li 


ij 


lk 


N62C 


4 


4 


3 b 


4 


3 b 


7 


S156C 


4 


4 


4 


4 


4 


7 


S166C 


4 


3 


3 c ,4 d ,2 d 


3 b 


3 b 


5 b 


L217C 


3 b 


2 b 


J 




3 b 


6 b 



a Isolated as single species unless indicated. 
b Single product mass by ES-MS. 



5 c Major component. 
d Minor component. 

Fxamnle 5 - Incubation of L217C-S-p-Glc(Ac) 3 at pH 9.5 

, 0 The general procedure for modification of SBL mutants stored as flash- 

frozen solutions was used to incubate 1.26 mg of L217C-S-p-Glc(Ac) 3 as a 0.5 mL 
aliquot in the absence of MTS reagent for 2 hours to give L217C-S-p-Glc as the sole 
product. MS m/z (ES-MS): L217C-S-P-GIc calculated 26882, found 26885. 

1 5 Example 6 - Active Site Titrations 

The active enzyme concentration was determined as previously described 
(Hsia et al., "Active-Site Titration of Serine Proteases Using a Fluoride-Ion Selective 
Electrode and Sulfonyl Fluoride Inhibitors," AnaLBiochenL, 242:221-227 (1996), which 
20 is hereby incorporated by reference) by monitoring fluoride release upon enzyme reaction 
with a-toluenesulfonyl fluoride (PMSF) as measured by a fluoride ion sensitive electrode 
(Orion Research 96-09). The active enzyme concentration determined in this way was 
used to calculate k eal values for each CMM. 
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Fxamnle 7 - Kinetic Measurements 

Michaelis-Menten constants were measured at 25(± 0.2)° C by curve 
fitting (GraFit* 3.03. Erithacus Software Ltd.. Staines. Middlesex. UK) of the initial rate 
5 data determined at nine concentrations (0. 1 25 mM-3.0 mM) of succinyl-AAPF-/?NA 
substrate in 0.1 M Tris-HCl buffer containing 0.005% Tween 80. 1% dimethylsufoxidc 
("DMSO"). pH 8.6 (e 4l „ = 8800 M" 1 cm ') (Bonneau et al.. .1. Am. Chem. Soc. 1 19:1026- 
1 030 ( 1 99 1 ). which is hereby incorporated by reference). 

10 Example 8 - Controlled Site Selective Glycosylation of Proteins by a Combined Site- 
Directed Mutagenesis and Chemical Modification Approach 

Four SBL sites at different locations and of different characteristics were 
selected for mutation to cysteine in order to provide a broad test of the glycosylation 
15 methodology. SI 56 of the S r pocket (Nomenclature of Schechter; Berger. Biochem. 

Rio phvs. Res. Commun. . 27:157-162 (1967). which is hereby incorporated by reference) 
is a surface-exposed residue that permits the introduction of externally-disposed glycans 
mirroring those found naturally in glycoproteins (Molecular Glvcobiologv. Fukuda et al.. 
Eds., Oxford University, Oxford (1994). which is hereby incorporated by reference). In 
20 contrast, N62 in the S 2 pocket, S166 in the S, pocket, and L217 in the S,' pocket have side 
chains which are internally oriented and test the applicability of the method for 
introducing sugars at hindered locations. Broad applicability with respect to the sugar 
moiety was evaluated by using the representative series of protected and deprotected. 
mono- and disaccharide methanethiosulfonates ("MTS") la-k (see Figure 8). These were 
25 prepared from their parent carbohydrates in good to excellent yields (Figures 6 (Reagents 
and Conditions: (i) Ac,0, py then HBr. AcOH; (ii) NaSS0 2 CH ? , EtOH, 90° C; 
(iii) Br(CH 2 ) 2 OH, BF 3 .Et 2 0 then Ac 2 0. py; (iv) NaSSO,CH,, DMF, 50° C; (v) NaOMe. 
MeOH; (vi) Ac 2 0, py then Br(CH,),OH, BF,.Et 2 0. DCM) and 7 (Reagents and 
Conditions: (i) Ac 2 0, py. 92% for 5d. 99% for 5e; Ac 2 0. NaOAc. 82% for 5f; 
30 (ii) Br(CH 2 ) 2 OH, BF,.Et 2 0. DCM. 70% for 4i, 67% for 4j, 53% for 4k; (iii) NaOMe. 

MeOH, 96% for 4d. 92% for 4e. 90% for 4f; (iv) NaSS0 2 CH 3 . DMF. 50° C. 80% for Id. 
71% for le. 68% for If. 88% for li. 83% for Ij. 88% for lk)). Two types of 
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glycosylating reagents, the anomeric methanethiosulfonate la and the ethyl-tethered 
methanethiosulfonates lb,c,g, h, were prepared from D-glucose (2a, Figure 6). The 
preparation of these reagents in fully protected la,g,h and deprotected lb,c forms allowed 
the effects of increased steric bulk and hydrophobic^ to be assessed. Untethered MTS 
reagent la was readily prepared from acetobromoglucose (3) using NaSS0 2 CH 3 as shown 
in Figure 6 (Prepared from D-glucose according to Scheurer et ah, 1 Am. Chem. Soc , 
76 3^4 (1954), which is hereby incorporated by reference). For the preparation of lb,g, 
an a-linked ethyl tether was introduced using Fischer glycosidation of D-glucose (2) with 
o_ br omoethanol. Treatment of the tetraacetylbromide 4g with NaSSO,CH, allowed the 
preparation of the peracetylated a-gluco-MTS Ig in an excellent 90% yield. Zemplen 
deviation (Zemplen et al., Ber Dtsch Chem. Ges., 56:1705-1710 (1923), which is 
hereby incorporated by reference) of bromide 4g and subsequent displacement ofbrom.de 
by methanethiosulfonate ion proceeded smoothly to yield the fully deprotected a-gluco- 
MTS lb in 69% yield. The P -D-gluco-MTS reagents 1c and lb, which are ejnmenc at C- 
, relative to lb and Ig, respectively, were prepared from the corresponding peracetylated 
(J-bromide 4h. The preparation of 4h took advantage of well-defined methodology 
utilizing Lewis acid catalyzed displacement of anomeric acetates by alcohols (Dahmen et 
al "2-Bromoethyl Glycosides - Synthesis and Characterization," CarbohydL Res. , 
1 16 303-307 (1983), which is hereby incorporated by reference). The protected bromide 
4h was elaborated to the corresponding peracetylated (Ih) and deprotected (1c) P -gluco- 
MTS reagents in an essentially identical manner to that used for the epimeric a-gluco- 
MTS reagents. Thus, using NaSS0 2 CH 3 , 4h gave Ih in 78% yield and, following 
deprotection, 4c (Helferich et al., JugL-Li gb Ann. Chem. , 541 =1-16 (1939), which is 
hereby incorporated by reference) afforded 1c in 68% yield from 4h. Parallel routes 
allowed similarly efficient access to the a-D-manno-MTS reagents Id and li, wh.ch are 
epimeric at C-2 relative to lb and Ig, respectively, and the P -D-galacto-MTS reagents le 
and Ij, epimeric at C-4 relative to 1c and lb, respectively (Figure 7). The ready 
adaptability of this method to oligosaccharides was illustrated by the preparation of the 
peracetylated (Ik) and fully deprotected (If) disaccharide lacto-MTS reagents, in good 
overall yields from lactose {If) of 38% and 27% respectively without cleavage of the 
interresidue bond. 
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Fxam ple 9 - Site Specific Glycosylation. 

The glyco-MTS reagents la-k (See Figure 8) were reacted with the chosen 
cysteine mutants SBL-N62C, -S156C, -S166C and -L217C in aqueous buffer under 

5 conditions described previously (Stabile et ah, Bioors. Med . Chem. Lett., 6:2501-2512 
(1996); Berglund et ah, J. Am. Chem. Soc. , 1 19:5265-5266 (1997); DeSantis et al., 
Biochem. , 37:5968-5973 (1998), which are hereby incorporated by reference). These 
reactions were rapid and quantitative, as judged by monitoring of changes in specific 
activity and by titration of free thiols with Ellman ? s reagent (Ellman et al., Biochem. 

10 Pharmacol. , 7:88-95 (1961), which is hereby incorporated by reference). The 

glycosylated chemically modified mutants (CMMs) were purified by size-exclusion 
chromatography and dialysis, and their structures were confirmed by rigorous ES-MS 
analyses (± 7 Da) as shown in Table 3 below: 
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Table 3. 

Properties and Kinetic Parameters 0 o f Modified Enzymes 

Reacn. pH 




29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 



la 
lb 
Ic 
Id 
le 
If 
1° 
lh 
lh 
li 

U 
Ik 



9.5 
6.5 
6.5 
6.5 
6.5 
6.5 
5.5 
6.5 
5.5 
5.5 
5.5 
5.5 



N62C-SV 

N62C-S-b b 

N62C-S-c b 

N62C-S-d b 

N62C-S-c b 

N62C-S-P 

N62C-S-g b 
N62C-S-Et-P-Glc(Ac) : c 
N62C-S-El-p-Glc(Ac), c 

N62C-S-i b 
N62C-S-El-|Kjal<Ac) ? c 

N62C-S-k b 



174 ±9 
67.9 ±3.5 
135.3 ±3.5 

132.7 ±4.0 
132.9±3.1 
119.3 ±3.6 

129.8 ±2.4 
1 20.0 ±2.7 
87.7 ±4.2 

1 00.3 ±3.5 
123.0± 1.6 

103.4 ±4.3 
64.9± 1.5 



1.90 ±0.20 
0.52 ± 0.07 
0.94 ± 0.05 
1.25 ±0.08 
1 .04 ± 0.05 
0.99 ±0.07 

1.04 ±0.04 
0.52 ± 0.03 
1.63± 0.15 
1.86 ±0.12 

1 .05 ± 0.03 
2.36 ±0.17 
0.88 ± 0.05 



15 L217C 

16 

17 

IS 

19 

20 

21 

22 

23 

24 

25 

26 

27 



la 
la 
la 
lb 
lc 
Id 
lc 
If 

lg 
lb 
li 

Ik 



9.5 
7.5 
5.5 
6.5 
6.5 
6.5 
6.5 
6.5 
5.5 
5.5 
5.5 
5.5 
5.5 



41 ± I 

L2I7C-S-P-GIC" 27.7 ±0.4 

U 1 7C-S-jK5!c(Ac) 2 c 44.9 ± 2.0 

L2 1 7C-S-fK31c( Ac) : , c 36.3 ± 0.8 

L217C-S-b b 57.8 ±0.6 

L217C-S-c b 50.6 ±0.9 

L2l7C-S-d b 62.0 ±1.3 

L217C-S-e b 46.2 ±0.8 

L217C-S-f 30.4 ±0 6 

L2 1 7C-S-Et-a-G1c< Ac) 2 e 72.7 ± 3. 1 

L2 1 7C-S-Et-P-Glc(Ac) : , c 29.4 ± 0.8 

L2 1 7C-S-Et-a-Man(Ac) 5 c 97.8 ± 2.4 

L2 1 7C-S-Et-&-Gal(Ac),< 39.2 ± 0.8 

L2 1 7C-S-Et-Lac(AcV 27 . 1 ± 0.6 



0.80 ± 0.04 
0.79 ± 0.03 
0.44 ± 0.06 
0.36 ± 0.03 
0.67 ± 0.02 
0.67 ± 0.03 
0.55 ± 0.03 
0.63 ± 0.08 
0.46 ± 0.03 
0.73 ± 0.08 
0.93 ± 0 06 
0.59 ± 0.04 
1.17 ±0.05 
0.69 ± 0.04 



130.6 ± 18.8 
143.9 ±8.5 
106.2 ±7.5 
127.8 ± 6.8 
1 20.5 ± 9.3 
124.8 ±5.3 
230.8 ± 14.3 

53.8 ± 5.8 

53.9 ±4.0 
117.1 ±3.7 
43.8 ±3.6 
73.8 ±4.5 
51 ±3 



35.1 ± » 4 
102.0± 14.6 
100.8 ±8.7 
86.3 ± 2.7 

75.5 ± 3.6 

112.7 ±6.6 
73.3 ±3.7 
66.1 ±4.5 

99.6 ± 11.8 
316±2.2 

165.8 ± 12.0 
33.5 ± 1.6 
39.3 ± 2.4 



S156C S t 



la 
lb 
lc 
Id 
le 
If 

lg 
lb 
li 

lj 
Ik 



9.5 
6.5 
6.5 
6.5 
6.5 
6.5 
5.5 
5.5 
5.5 
5.5 
5.5 



Sl56C-S-a b 
S156C-S-b b 
S156C-SV 
S156C-S-0* 
SI56C-S-* 6 
S156C-S-f 
SI 560-5^ 
S156C-S-h b 
S156C-S~i b 
S156C-S-j b 
SI56C-S-k b 



41 

42 
43 
44 
45 
46 
47 
48 
49 

50 

51 

52 
53 



S166C S, 



la 
lb 
lc 
Id 
le 
If 



lh 



lh 



lk 



95 S166C-S-a b 

6 s S166C-S-b b 

6 S S166C-S-c b 

6 S S166C-S-<r b 

6 s Sl66C-S-e b 

6.5 SlooXT-S-f 

5.5 SI 66C-S-Et-a-Glc( Ac>;. c 

6.5 S 1 66C-S-Et-f*-G1c(Ac) 2 e 

SI66C-S-Et-^GMAc) : . d 

5 .5 SI 66C-S-Et-£-Glc(Ac)/ 

Sl66C-S-h c 

5 S Sl66C-S-Et-a«Man<Ac) 5 c 

5.5 SI 66C-S-Et-p-Gal(Ac) 5 < , 

5.5 S^PS- El-LacCAc)?' ' 



I25±4 

54.8 ± 1.3 
77.0 ± 1.2 
76.6 ± 1.7 
88.6 ±2.8 

78.9 ± 1.9 
63.6 ± 1.4 
43 6 ± 0.8 
64.0 ± 1.3 
60.3 ± 0.9 
5I.9±0.6 

53.6 ±0.8 
42 ± 1 

33.8 ± 13 
81.9 ± I.I 
67.0 ± 2.2 
76.5 ± 2.0 
62.2 ±1.4 
58.2 ± \2 
3I.0±0.8 

95.0 ±2. 1 

72.9 ±1.7 

67.7 ± 1.9 

65.1 ±0.9 
67.4 ± 1.6 



0.85 ± 0.06 
0.70 ± 0.04 
0.84 ±0.03 
0.73 ± 0.04 
0.79±0.06 
0.89 ± 0.04 
0.89 ± 0.05 
0.78 ± 0.04 
0.72 ± 0.04 
0.71 ±0.03 
0.61 ±0.02 
Q.79±0.03 
0.50 ± 0.05 
0.66 ± 0.06 
1.14 ±0.03 
0.99 ±0.07 
1.17 ±0.07 
1.08 ±0.05 
1.02 ±0.04 
0.77 ± 0.05 
0.87 ± 0.05 

0.65 ±0.04 

1.64 ±0.09 
0.80 ±0.03 

1.65 ±0.07 



147 ± 11 

78.3 ± 4.8 
91.7 ±3.6 
104.9 ±6.2 
112.2 ±9.2 

89.7 ±4.4 

71.8 ±4.3 

55.9 ±3.0 
88.9 ±5.2 
84.9 ±3.8 
85.1 ±3.0 

67.4 ± 2.8 
84 ±9 



51.2 ±5.0 
7I.8±2.I 
67.6 ±5.3 
65.4 ± 43 
57.6 ±3.0 
57.1±2.5 

40.3 ± 2.8 
109.2 ±6.7 

1 12.2 ±7.4 

41.3 ±2.5 
813*33 
40.8 ± 2.0 



» Michaelis-Menten constants were me asured at 25 "L according to the initial 
0.005% Twccn 80 T 1% DMSCX suc-AAPF-/>NA as the substrate. 
b Single species. 

c Single product mass by ES-MS. 
6 Major component. 
e Minor component. 



Tates method in 0.1 M Tris-HCI buffer at pH B.6. 
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The CMMs each appeared as a single band on non-denaturing gradient PAGE, thereby 
establishing their high purities. The active enzyme concentration of the resulting CMM 
solutions was determined by active site titration with a-toluenesulfonyl fluoride (PMSF) 

5 using a fluoride ion-sensitive electrode (Hsia et al„ "Active-Site Titration of Serine 
Proteases Using a Fluoride-Ion Selective Electrode and Sulfonyl Fluoride Inhibitors/' 
AnaL Biochem. , 242:221-227 (1996), which is hereby incorporated by reference). In all 
cases, modification with the fully deprotected reagents lb-f (See Figure 8) led to site- 
specific glycosylations and the formation of single glycoforms. Furthermore, 

10 modification with the protected MTS reagents la, g-k (See Figure 8) gave products with 
controllable levels of acetylation. Through adjustment of pH and appropriate selection of 
the glycosylation site, differently acetylated glycoforms of SBL were prepared. This 
ability to modulate the level of acetylation through pH-control vastly expands the 
structural variety of glyco-CMMs that can be conveniently accessed and its scope was 

15 probed through the reaction of la (See Figure 8) with SBL-N62C, -SI 56C, -SI66C, - 
L217C (Figure 9). 

The extent of deacetylation during modification was highly site-dependent. 
Modification of L217C with reagent la (See Figure 8) at pH 9.5 was accompanied by 
complete in situ deacetylation, and the sole product was the fully deprotected 

20 glucosylated-SBL,L217C-S-p-Glc. In contrast, treatment of N62C, SI 56C, and SI 66C 
with la (See Figure 8) at pH 9.5 yielded only fully acetylated products, N62C-S-a (See 
Figure 8), S156C-S-a (See Figure 8), and S166C-S-a (See Figure 8), respectively. To 
examine the effects of pH upon deacetylation, the reaction of L217C with la (See Figure 
8) was chosen. At pH 7.5 and 5.5, the products retained two and three acetate groups, 

25 forming L2 1 7C-S-p-Glc( Ac) 2 and L2 1 7C-S-P-Glc( Ac) 3 , respectively. In all cases, 
complete integrity of the site selectivity was retained. 

This valuable site-dependent deacetylation was attributed to a novel 
intramolecular SBL-catalyzed process. Although acetate esters are moderately 
chemically labile in aqueous solution at pH 9.5, they are not at either pH 7.5 or 5.5 

30 (Greene et al., Protective Groups in Organic Synthesis , 2nd ed. New York, Wiley (1991), 
which is hereby incorporated by reference). The striking differences in behavior during 
modification between L217C and the three other mutants N62C, S156C, and S166C 
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under identical reaction conditions discounted the possibility of deacetylation prior to 
modification. In addition, it was noted that position 217 bears an internally-oriented side 
chain and that modification of surface exposed position 156 showed no sign of 
deacetylation. This observation discounted both the possibility of either in situ chemical 
deacetylation or intermodular enzymatic deacetylation. Furthermore, this ability of SBL 
to intramolecularly deacetylate was confirmed by the reaction of L217C-S-(J-Glc(Ac), at 
pH 9.5. Incubation of L217C-S-p-Glc(Ac)., under standard modification reaction 
conditions, but without reagent la (See Figure 8), gave L217C-S-p-Glc as the sole 

product (Figure 9). 

The enormous potential of this method was demonstrated by the 
preparation of a small library of differently acetylated glycosylated CMMs through the 
reaction of SBL-N62C, -S156C, -S166C, and -L217C withMTS reagents lg-k (See 
Figure 8). Using the pH-activity profiles of wild-type ("WT") and CMMs of SBL as a 
guide, pH 5.5 and 6.5 were chosen to minimize deacetylation. Typically, the specific 
activity of SBL and its CMMs drops sharply below pH 7.5 to levels that at pH 5.5 are 5- 
20% those at optimal pH (8.5-9.5) (Desantis et al., "Chemical Modifications at a Single 
Site Can Induce Significant Shifts in the pH Profiles of a Serine Protease," .T. Am. Chem. 
Soc,, 120:8582-8586 (1998), which is hereby incorporated by reference). As expected, 
this drop in hydrolytic activity was reflected in the products of these modifications with 
lg-k (See Figure 8), which in all cases retained two or more acetate groups. 

For example, at pH 5.5 reactions of SBL-L217C and -S166C created 
singly deacetylated CMMs, with the exception of pure dideacetylated glyco-CMMs 
L217C-S-Et-a-Glc(Ac) 2 , and S166C-S-Lac(Ac) 5 . The formation of the latter may reflect 
the presence of two primary acetates in disaccharidic MTS reagent lk (See Figure 8). 
Primary acetate groups are typically more labile than secondary acetate groups under 
conditions of intermolecular enzymatic deacetylation (Bashir et al., "Enzymatic 
Esterification and De-Esterification of Carbohydrates - Synthesis of a Naturally- 
Occurring Rhamnopyranoside of P-Hydroxybenzaldehyde and a Systematic Investigation 
of Lipase-Catalyzed Acylation of Selected Arylpyranosides," T Chem. Soc, Perkin 
JransJ, 2203-2222 (1995), which is hereby incorporated by reference). In contrast, 
reactions of SBL-S156C gave only the fully acetylated CMMs, S156C-S-g-k (See Figure 
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8). This uniform lack of deacetylation observed for surface exposed glycans at position 
156 was consistent with an intramolecular enzyme-catalyzed mechanism requiring 
internally-oriented acetate groups. Interestingly, the reactions of SBL-N62C were also 
determined by the anomeric configuration of lg-k (See Figure 8) with a-MTS reagents 

5 lg,i (See Figure 8) giving products that retained all acetate groups while p-MTS reagents 
lhj.k (See Figure 8) were monodeacetylated. The range of accessible acetylated glyco- 
CMMs was further extended through modification at pH 6.5. For example, at position 62, 
it allowed the introduction of diacetylated p-glucose, forming N62C-S-Et-P-Glc(Ac) 2 , in 
place of the triacetylated N62C-S-Et-p-Glc(Ac) 3 formed at pH 5.5. The range of 

10 acetylation at position 1 66 was similarly expanded through the formation S166C-S-Et-p- 
Glc(Ac) 2 in place of S166C-S-Et-p-Glc(Ac) 3 . 

Example 10 -Glycan Structure-Hydrolytic Activity Relationships 

, 5 The effects of glycosylation upon SBL were assessed by the determination 

of *„„ and K u for the hydrolysis of succinyl-AAPF-^-nitroanilide (Suc-AAPF-pNA) at 
pH 8.6. The kinetic parameters of the 48 CMMs generated were compared with those of 
WT and unmodified mutants in Table 3. The excellently selective and controlled method 
shown in Figure 8 allowed the introduction of structurally related monosaccharides, D- 
20 glucose, D-galactose, and D-mannose, in addition to the more sterically bulky 

disaccharide lactose. From the resulting glycosylated CMMs, a detailed and precise set of 
structure-activity relationships was generated (Figures 10-12). 

At position 62, in the S 2 pocket, the 2.3-fold reduction in kJK M caused by 
mutation to cysteine was partially restored by glycosylation (Figure 10A). The 
25 introduction of ethyl-tethered a- or p-glucose, p-galactose or a-mannose to N62C 

increased kJK M and formed N62C-S-b-e (See Figure 8) with kJK+ 1 -5- to 2-fold lower 
than WT. Despite its steric bulk and high hydrophilicity, disaccharidic lacto-CMM 
N62C-S-f (See Figure 8) also showed higher activity thanN62C with kJK M orAy 1.7-fold 
lower than WT. 

30 The effects of the mutation of position 217 in the S,' pocket were 

intrinsically more dramatic as indicated by a value of kJK M for L217C that is 4-fold 
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lower than WT (Figure 10B). The introduction of deprotected untethered glucose, 
forming L21 7C-S-P-Glc, lowered kJK M further to 6-fold lower than WT. In contrast, 
glycosylation of position 217 with ethyl tethered MTS reagents lb-f (See Figure 8) 
restored activity and kJK& for L21 7C-S-b-f (See Figure 8) were similar to each other in 
the range 2.5- to 3.1 -fold lower than WT. This striking difference between tethered 
L217C-S-b-f (See Figure 8) and untethered L217C-S-P-Glc illustrated that SBL tolerates 
the replacement of hydrophobic Leu with highly hydrophilic carbohydrate moieties when 
they are linked by a hydrophobic ethyl spacer group better than directly-linked Cys-S-p- 
Glc. This may indicate that a structural requirement for efficient amidase activity is a 
closely-bound hydrophobic residue in the S,' subsite of SBL and contrasts sharply with 
the excellent enhancement of esterase activity caused by the same Cys-S-P-Glc 
substitution. 

Mutation of position 156 in the S, pocket to cysteine caused a 1 .4-fold 
drop in kJK M (Figure IOC). Subsequent introduction of deprotected S-Et-a-Glc, side 
chain b (See Figure 8), resulted in a kJK u for S 1 56C-S-b (See Figure 8) that was 2.3- 
fold lower than WT. From S 1 56C-S-b to -f (See Figure 8), kJKjt varied in an arced 
manner peaking at 1 .9-fold lower than WT for S 1 56C-S-d (See Figure 8) and then 
decreasing monotonically to a kJK M for S 1 56C-S-f (See Figure 8) that was 3-fold lower 
than WT. The similar K M values for these S 1 56 CMMs to those of SBL-WT were 
indicative of these modifications having little effect upon ground state binding and were 
consistent with the surface exposed orientation of the S 1 56 side chain. 

At position 166, in the S, pocket, the 2.5-fold decrease in kJK M caused by 
mutation to cysteine was amplified by modification with lb (See Figure 8) and led to a 
UK M value 3-fold lower than WT for SI 66C-S-b (See Figures 8 and 1 0D). From 
S166C-S-b to -f (See Figure 8), kJK M decreased monotonically to a kJK M for S166C-S- 
f (See Figure 8), in which the S, binding site was occupied by the sterically bulky 
disaccharide lactose, that was 3.8-fold lower than WT. 
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Examplc 11 - Kinetic Effects of Glycosylate with Acetylated Carbohydrates. 

The enormous potential of the controlled site-selective glycosylation 
approach depicted in Figure 8 was illustrated by the great variety of changes in * t ,,/^ A/ that 
were caused by the introduction of acetylated side chains a,g-k (See Figure 8) to SBL. 

5 These dramatic changes contrast with the slight variations found for deprotected side 
chains b-f (See Figure 8). For example, at position 62, an alternating decrease- increase 
pattern was observed (Figure 1 1 A). This resulted in a kJK s , for tetraacetylated a-gluco- 
CMM N62C-S-g (See Figure 8) that was 1 . 1 -fold higher than WT. Similar alternating 
patterns were also seen at positions 217 (Figure 1 IB) and 166 (Figure 1 ID). At position 

10 1 56, variations were slight, which was consistent with its surface exposed orientation 
(Figure 11C). 

To examine the cause of these variations, the k c JK x jS of acetylated 
glycosylated CMMs were compared with those for deprotected glycosylated CMMs with 
the same glycan structure and stereochemistry (Figure 12). This separated the effects of 

1 5 acetylation from the effects of glycosylation and allowed the underlying effects of 
modification to be dissected. It was clear from Figure 12 that the anomeric 
stereochemistry of the acetylated glycans modulates kJK M . 

For example, at position 62 (Figure 12A) comparison of N62C-S-b,c (See 
Figure 8) with N62C-S-g (See Figure 8) and N62C-S-Et-p-Glc(Ac) 2 . 3 showed that 

20 increasing the number of acetate groups from zero to four, from N62C-S-b (See Figure 8) 
to N62C-S-d (See Figure 8), increased kJK M 1 .6-fold for the a-gluco side-chain b (See 
Figure 8). In contrast, increasing the number from zero to two or three, from N62C-S-C to 
N62C-S-Et-(3-Glc(Ac) 2or3 , was detrimental for the p-gluco side-chain c (See Figure 8) and 
led to a 2-fold decrease. Similarly, N62C-S-Et-(3-Gal(Ac) 3 displayed a distinctly lower 

25 kJK„ than N62C-S-e (See Figure 8) that was 5-fold lower than WT. These changes in 
kJK M upon acetylation were manifested largely through increased or decreased ground 
state binding, of which the most striking example was a K M for N62C-S-Et-p-Gal(Ac) 3 
that was 2.4-fold higher than deprotected galacto-CMM N62C-S-e (See Figure 8). 

At position 217, control of the level of acetylation through pH, as shown in 

30 Figure 9, had allowed the introduction at position 2 1 7 of untethered p-D-glucose bearing 
zero, two, and three acetate groups. As Figure 1 2B illustrates, the addition of two or three 
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acetate groups restored kJK. u from 6-fold lower than WT for L21 7C-S- P -Glc to 2-fold 
lower than WT for L21 7C-S- P -Glc(Ac) 2 or L2 1 7C-S- P -Glc(Ac) 5 . This showed that 
acetylation allowed fine-tuning of activity and paralleled increases in the esterase kJK* 
of these CMMs. 

The same trend in kJK AI was observed for the L21 7C ethyl tethered 
CMMs as at position 62: acetylation was beneficial to a-tethered but detrimental to P - 
tethered CMMs. For example, increasing the number of acetate groups in the a-linked 
glucose moiety from zero to two, i.e., from L21 7C-S-b (See Figure 8) to L217C-S-Et-a- 
Glc(Ac) 2 , increased kJK„ to 2-fold lower than WT. In contrast, increasing the number of 
acetate groups in the epimeric P -linked moiety from zero to three, i.e., from L217C-S-C 
(See Figure 8) to L2 1 7C-S-Et- P -Glc(Ac) 3 , halved kJK M to 6-fold lower than WT. 
Similarly, the kJK M of a-linked L217C-S-Et-a-Man(Ac) 3 was 1 .5-fold higher than the 
corresponding deprotected L217C-S-d (See Figure 8), while P -linked L217C-S-Et- P - 
Gal(Ac), was 2-fold lower than the corresponding deprotected L217C-S-e (See Figure 8). 

Consistent with its surface exposed orientation, the changes at position 1 56 
caused by acetylation were slight and no variation with anomeric stereochemistry was 
seen (Figure 12C). Increasing the number of acetate groups from zero to four, from 
S156C-S-b-e (See Figure 8) to S156C-S-g-j (See Figure 8), decreased kJK M hy 1.05- to 
1.6-fold. The most sterically bulky lactose side chain (-k) (See Figure 8) gave rise to the 
lowest kJK M at position 156, 3.2-fold lower than WT, and indicated that even at the 
surface of SBL the introduction of sterically bulky groups still allowed tailoring of 

At position 166, the effects of increased acetylation were, as at positions 
62 and 217, modulated by glycan anomeric configuration (Figure 12D). However, the 
direction of these increases and decreases was reversed: acetylation was beneficial to P - 
tethered but detrimental to a-tethered CMMs. For example, the a-tethered S166C-S-Et- 
a-Glc(Ac) 3 had a 1.8-fold lower Rvalue than the corresponding deprotected S166C- 
S-b (See Figure 8), while P -linked CMMs S166-S-Et- P -Glc(Ac) 2J had 1.6-fold higher 
k Rvalues than the corresponding fully deprotected S166-S-C (See Figure 8). Again, 
these variations were largely manifested through changes in ground state binding. For 
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example, K,, increased 1.4-fold from S 1 66C-S-Et-a-Man (-d) (See Figure 8) to S166C-S- 
Et-a-Man(Ac),. 

It should be noted that the changes in activity of the lacto-CMMs upon 
acetylation fell largely outside of these trends and at all four positions acetylation of the 
5 bulky, disaccharidic side-chain f (See Figure 8) caused a general decrease in kJK.fi For 
example, at position 62 heptaacetylation, from N62C-S-f (See Figure 8) to N62C-S-k 
(See Figure 8), resulted in a lowering of kJK u U> a value that is 3-fold lower than WT. 
Despite the greater steric bulk of side chain k (See Figure 8), this drop was a consequence 
of a lower k c ,„, 2-fold lower than N62C-f (See Figure 8), rather than a higher K„. In fact, 
1 o the K A , value of N62C-S-k (See Figure 8) was 1 .3-fold lower than N62C-S-f (See Figure 
8). Similarly, L217C-S-Et-Lac(Ac) 6 and S166C-S-Et-Lac(Ac) 5 had 1.7-fold and 1.4-fold 
lower kJK A .fi than the corresponding deprotected CMMs, respectively. 

In summary, the strategy of site-directed mutagenesis combined with 
chemical modification was exploited for the site-selective glycosylation of SBL. This 
1 5 method was general, versatile and allowed the preparation of pure glycoforms which 
constitute the first examples of regio- and glycan- specific protein glycosylation at 
predetermined sites. Careful control of a novel SBL-catalyzed intramolecular 
deacetylation greatly expanded the scope of this method and through reaction of SBL- 
N62C, -S156C, -S166C, and -L217C with peracetylated MTS reagents la,g-k (See Figure 
20 8) allowed the introduction of glycans with precisely modulated levels of acetylation. 

The glycosylated CMMs formed display k c JK M values that ranged from 
1 .1-fold higher than WT to 7-fold lower than WT. Without the use of this highly 
selective glycosylation technique, the determination of such precise trends would be 
unachievable and variations caused by previous non-specific glycosylation could only be 
25 interpreted in a general manner. It has been demonstrated that subtle differences in 
carbohydrate structure may be used to fine tune the activity of SBL. For example, the 
anomeric stereochemistry of the glycans introduced modulated changes in kJKuVpan 
acetylation. At positions 62 and 217, acetylation enhanced the activity of a-tethered 
CMMs but decreased that of p-tethered. This trend was reversed at position 166 where, in 
30 contrast, acetylation enhanced the kJK.fi of P-tethered CMMs but decreased those of ct- 
tethered. Consistent with its surface exposed nature, changes at position 1 56 were more 



WO 00/01712 



PCT/US99/15138 



-49- 



10 



modest, but still allowed control of activity particularly through glycosylate with 
disaccharide lactose. These results illustrated the great potential for tailoring activity 
through the correct choice of glycan and glycosylation site. 

The ability of the glycosylation method to glycosylate the binding pockets 
of SBL also creates opportunities to broaden its substrate specificity. For instance, an 
array of hydrogen bonding hydroxyl groups may broaden its specificity towards hydrogen 
bonding substrates such as glycosylated amino acids. Subtilisins have been elegantly 
used to catalyze the synthesis of glycopeptides (Witte et al., "Solution- and Solid-Phase 
Synthesis of N-protected Glycopeptide Esters of the Benzyl Type as Substrates for 
Subtilisin-Catalyzed Glycopeptide Couplings," 1 Am. Chem. Soc. 120:1979-1989 
(1998); Wong et al., "Enzymatic-Synthesis of N-Linked and O-Linked Glycopeptides," L 
Am Chem. Soc. 1 15:5893-5901 (1993), which are hereby incorporated by reference). 
However, the natural specificity of these enzymes has limited these peptide ligations to 
those in which the glycosylated residues are at least one residue distant (P 2 ,P 3 - or 
15 p,\P,\.0 from the amide bond formed. For example, while ligation of Z-Gly-OBz with 
H-Gly-Ser(Ac 3 GlcNAcp>NH 2 was successful, no yield of product was obtained with H- 
Ser(Ac 3 GlcNAcp>NH 2 (Witte et al., .1 Am. Chem. Soc, 120:1979-1989 (1998), which is 
hereby incorporated by reference). The introduction of sugars to the S, and S/ subsites as 
hydrogen bonding groups demonstrated here may enhance the specificity of proteases 

20 towards hydrophilic substrates. 

Furthermore, by choosing carbohydrate attachments that differ from each 
other at only one stereocenter, SARs may be determined by examining changes in activity 
as the nature of sugar side chain is varied. For example, the effect of inverting 
stereocenters in the order C-4-C-1-+C-2 can be determined using CMMs in the series e 

25 _> c -> b-* d (See Figure 8). While the current illustrations have been with SBL as a 

protein example, the method is clearly amenable to the glycosylation of any protein and is 
without limitation with respect to the sites and to the glycans that may be conjugated. It 
will, therefore, allow the introduction of any therapeutically important carbohydrate 
recognition determinant, of which the p-D-galactopyranosyl moiety of e and f (See Figure 

30 8) that represents a ligand of the hepatic asialoglycoprotein receptor (Sharon et al., Essays 
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Biocheiru 30:59-75 (1995), which is hereby incorporated by reference) is just one 
example. 

Example 12 - Esterase Screen 

5 Specificity constants determined using the low substrate approximation 

were measured indirectly using Ellman's reagent (Ellman et al., Biochem. Pharmacol., 
7:88-95 (1 961), which is hereby incorporated by reference) (e 4l2 = 13600 M'W 1 ) using 
0.15 and 0.30 mM succinyl-AAPF-SBn as substrate in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

10 

Example 13 - Full Esterase Kinetics Measurements 

Michaelis-Menten constants were measured at 25 °C by curve fitting 
(Grafit® 3.03, Erithacus Software Ltd., Staines, Middlesex, UK) of the initial rate data 
determined at eight concentrations (31.25 \xM - 3 mM) of the succinyl-AAPF-SBn 
15 substrate, followed indirectly using Ellman's reagent in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

Example 14 - Esterase Activity Screen 

The glyco-CMMs shown in Table 4 were prepared, by reacting reagents 
20 la-k (See Figure 8) with the chosen cysteine mutants SBL-N62C, -SI 56C, -SI 66C and - 
L217C, purified and extensively characterized as described previously. 
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Table 4. 

Esterase Screen Results for Glycosylated CMMs* 



N62C-R 



L217C-R 



SI66C-R 



-R 




E/A b 






(s-'mM-') 




(s-'mM" 1 ) 


H 


4380 


48 


5540 


-opvjic 






6350 


n 
D 


jjt / 


37 


7773 




6625 


62 


9791 


D 


HOi / 


36 


8651 


E 


5729 


48 


11923 


F 


5990 


48 


9080 


<CIM"*tlrl Arl 
-opvJlCt r\t Ji 






8776 


-SPGIc<Ac) ; 






1 1642 


-SpGlclAc), 


2M)2 


1 Q 




(-a) 










- 


- 


7893 


SetctG!c(Ac) : 








SetaGlc(Ac)- 










/or 1 






SetaGlc(Ac), 








(-g> 




36 






1925 




SetpGlc(Ac), 






4935 




2736 


51 


SetpGlc(Ac) 5 








SetpGlc(Ac)j 








(-h) 














11751 


SetaMan(Ac) 










5807 


50 




SetaMan(Ac) 














3889 




2169 


50 



E/A b 



109 
186 
90 
130 
77 
163 
137 
86 
115 



79 



156 



71 



5058 
6033 

1537 



46 
54 

37 



116 



SetpGal(Ac) 5 

SelPGal(Ac), 
H) 

-SetLac(Ac) 5 
-SetLac(Ac) 6 
-SetLac(Ac) 7 

HO 



3159 



1543 



38 



38 



S156C-R 



(s-'mM" 1 ) 


E/A b 


KJ K M 

(s-'mM' 1 ) 


E/A b 


350 


4 






2688 


37 


1984 


22 


2732 


40 


2197 


21 


2924 


45 


2648 


24 


3241 


56 


2505 


28 


2511 


44 


1692 


24 


2537 


50 


1356 


17 


1829 


45 







1887 



2321 



5468 



139 



2135 



29 



2629 



2010 



1516 



34 



26 



31 



24 



22 



kinetic constants determined in duplicate us.ng the '™ \ g J^n "Jjj 

pH 8.6. 0.005% Tween 80. I % DMSO with suc-AAPF-SBn as substrate. [S] = 1 3 or 30 uM. [E] 4.S x u 

x 10* ,o M. 

b E/A = iKM^c f\kJK M )^. 

c For SBL-WT: kJK M = 3592.5 mM's 1 , E/A - 17. 

The kinetic parameters of esterase activity were determined at pH 8.6 by indirectly 
following the release of thiobenzyl alcohol from the substrate succinyl-Ala-Ala-Pro-Phe- 
SBn (suc-AAPF-SBn) with Ellman's reagent (Ellman et al., Biochem. Pharmacol. , 7:88- 
95 (1961), which is hereby incorporated by reference). To allow a rapid screen of 
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esterase activity, a low substrate concentration ([S] « K u ) was used that allowed k c JK u 
to be determined directly from the initial rate of reaction. The results from the screen are 

shown in Table 4. 

Modification at position 62, in the S, pocket, with deprotected sugar 

5 reagents 1 b-f (See Figure 8) increased kJK K „ resulting in a series of five enzymes that 
had similar kJK.jS that were 1 .3- to 1 .9-fold greater than WT (Figure 13A). The 
presence of an a-linkage was clearly deleterious to activity, as N62C-SEtpGlc (-c) (See 
Figure 8) had a kJK lU 1 .2-fold greater than its epimer N62C-SEtctGlc (-b) (See Figure 8) 
and 1.9-fold greater than WT. Furthermore, the oc-linked N62C-SEtccMan (-d) (See 

1 o Figure 8) had the lowest kJK u in this group which was 1 .3-fold greater than WT. 

As at position 62, the introduction of any of the deprotected sugar side 
chains b-f (See Figure 8) at position 217, in the S," pocket increased kJK M (Figure 13B). 
However, the effects of glycosylation at this site were far more dramatic as demonstrated 
by a kJK M for L217C-SEtpGal (-e) (See Figure 8) that was 3.4-fold greater than WT. By 

15 comparing the kJK K fi of L217C-S(3Glc and L217C-SEt(JGlc (-c) (See Figure 8), it was 
possible to gauge the effect on activity of introducing an ethyl tether at this position. This 
introduction increased kJK Ki , from 1.8-fold greater than WT for L217C-SpGlc to 2.7- 
fold greater than WT for L21 7C-SEtpGlc (-c) (See Figure 8). As at position 62, p-linked 
glyco-CMMs (-c,-e,-f) (See Figure 8) had higher kJK& than the a-linked ones (-a,-d) 

20 (See Figure 8). For example, L2 1 7C-SEtpGlc (-c) (See Figure 8) had a kJK M 1 .3-fold 
greater than L217C-SEtotGlc (-b) (See Figure 8). 

Consistent with the surface exposed orientation of the SI 56 side chain, the 
S156C deprotected glyco-CMMs had similar k^/K^ that were 1 .3- to 2.1-fold lower than 
WT (Figure 13C). 

25 At position 1 66, in the S, pocket, mutation to cysteine resulted in an 

enzyme with a severely lowered that was 10-fold lower than WT. However, 

subsequent modification with 1 b-f (See Figure 8) restored much of the catalytic activity 
(Figure 13D), and the S166C deprotected glyco-CMMs Sl66C-S-b-f (See Figure 8) had 
similar kJK K fi, that varied from 1.1- to 1.4-fold lower than WT. 
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Fxamnle 15 - Kinetic Effects of Glycosylate with Acetylated Carbohydrates. 

At position 62, in the S 2 pocket, in sharp contrast to the trend observed for 
the unacetylated N62C CMMs, acetylated N62C CMMs had a wide range of kJK.fi. 
Introduction of acetates increased the kJK.fi of the a-linked CMMs relative to the 
corresponding deprotected glyco-CMMs (Figure HA). Thus, N62C-SEtccGlc(Ac) 4 (-b) 
(See Figure 8) and N62C-SEtaMan(Ac) 4 (-d) (See Figure 8) had kJK.fi 1.9- and 1 .6-fold 
greater than WT, respectively. Acetylation was clearly deleterious for P-linked CMMs, as 
N62C-SEtpGal(Ac),, N62C-SEtpGlc(Ac) 2J and N62C-SEtpGlc(Ac) 3 all had kJK.fi 
lower than WT. However, increasing the number of acetates present on the CMM 
restored activity: N62C-SEtpGlc(Ac) 3 had a kJK„ only 1.3-fold lower than WT and 1.5- 
fold higher than N62C-SEtpGlc(Ac) 2 . In spite of their size, the sterically bulky side chain 
lactosylated N62C CMMs, N62C-SEtLac (-f) (See Figure 8) and N62C-SEtLac(Ac) 7 (-k) 
(See Figure 8) had kJK^fi that were similar to those of the CMMs derived from 
monosaccharides. This provided a clear example of the versatility of the glycosylation 
method illustrated in Figure 8 and demonstrated that by using this method it was possible 
to introduce very large structures into the active site of SBL while maintaining catalytic 
competency. 

Modification with acetylated reagents la,g-k (See Figure 8) at position 
217, in the S,' pocket, as with lb-k (See Figure 8), led to CMMs with greater than WT 
kJK.fi (Figure 14B). For the untethered glyco-CMMs, increasing the number of acetates 
dramatically increased kJK» from 1 .8-fold greater than WT for L21 7C-SpGlc to 2.4- 
fold greater than WT for L217C-SpGlc(Ac) 2 and to 3.2-fold greater than WT for L217C- 
SPGlc(Ac)., and mirrored the trend seen in amidase kinetics. For the ethyl linked L217C 
glyco-CMMs, the effect of acetylation was dependent on the anomeric stereochemistry, as 
observed for N62C glyco-CMMs and the L217C glyco-CMMs amidase kinetics. 
Acetylation of a-linked CMMs increased kJK M but decreased kJK M for P-linked 
CMMs. This was most pronounced for L217C-SEtpGal(Ac) 3 which had a kJK., only 
1.1-fold greater than WT and 3.1-fold lower than L217C-SEtpGal (-e). In contrast to the 
effect on deprotected L217C glyco-CMMs, the activity of acetylated L217C glyco-CMMs 
decreased upon the introduction of the ethyl linker. For example, L217C-SpGlc(Ac) 3 had 
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a k v JK K , 3.2-fold greater than WT, as compared with the 1 .4-fold greater than WT k c JK Kt 
ofL217C-SEtpGlc(Ac) 3 . 

At position 156, in the S, pocket, the S156C acetylated glyco-CMMs 
displayed little difference in their kinetic constants from their unacetylated counterparts 

5 (Figure 14C), an observation that was consistent with the surface exposed nature of the 
position 1 56 side chain. Introducing the ethyl linker led to an increase in kJK hl from 2.6- 
fold lower than WT for SI 56C-SpGlc(Ac) 4 (-a) (See Figure 8) to 1 .5-fold lower than WT 
for S156C-SEt(5Glc(Ac) 4 (-h) (See Figure 8). 

In general, at position 166, in the S, pocket, the effect of acetylation on 

10 SI 66C ethyl linked glyco-CMMs was to reduce k c JK A js relative to their deprotected 

counterparts (Figure 14D). The exceptions were the ethyl linked (i-gluco-CMMs, S166C- 
SEtpGlc(Ac) 2 and S166C-SEt(3Glc(Ac) 3 , which displayed kJK h p 1.4- and 1.7-fold 
greater than WT, respectively. These were the only two glyco-CMMs prepared at this site 
to show an enhancement in kJK M relative to WT, and this example illustrates that the 

15 correct selection of sugar is crucial to the tailoring of enzyme activity. In contrast to the 
effects observed at positions 62 and 217, an ot-linkage to the sugar moiety was deleterious 
to the activity of the acetylated CMMs and S166C-SEtotGlc(Ac) 3 had a kJK M 1 .9-fold 
lower than WT, in direct contrast to S166C-SEtpGlc(Ac) 3 . Introduction of the sterically 
bulky lactose moiety, in both acetylated and unacetylated forms, led to CMMs S166C- 

20 SEtLac (-f) (See Figure 8) and SI 66C-SEtLac(Ac) 5 with low kJK K p that were 1 .4- and 
2.3-fold lower than WT, respectively. 

Example 16 - Full Esterase Kinetics. 

The three esterases with the highest kJK„p determined by the above 
25 screen, L2 1 7C-SpGlc(Ac) 3 , L2 1 7C-SEtctMan( Ac) 3 , and L2 1 7C-SEt|JGal (-e) (See Figure 
8) had their individual k^ and K K p determined by the initial rates method. The results 
are shown in Table 5. 
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Table 5. 

Full Esterase Kinetic Data for Glycosylated CMMs 0 



Enzyme 



kJK u 
(mM-'s 1 ) 



xWT 



E/A 



WT 

L217C-S(3Glc(Ac) 3 
L2 1 7C-SEtaMan(Ac) 3 
L217C-SetpGal 



1940.0±180 
4427.5± 100.9 

3827.0±59.5 
4398.5*189.8 



0.54±0.07 
0.15±0.01 
0.30±0.01 
0.36±0.04 



3592.5*572.7 
29516.7±2079.0 

12756.7*469.2 
12218.1*1456.3 



1 

8.4 
3.6 
3.5 



17 
293 

77 
167 



10 



15 



20 



25 



a Kinetic constants determined by method of initial rates in 0.1 M Tris buffer, pH 8.6, 
0.005% Tween 80, 1% DMSO with suc-AAPF-SBn as substrate. [S] = 30 uM to 2 mM, 
[E]= 9.6 x 10 " to 1.1 x 10- ,0 M. 
b E/A = (kJKJ^ I (kJK u \ 



The results were in good agreement with those determined by the screen 
for L217C-SEtctMan(Ac) 3 and L217C-SEt|3Gal (-e) (See Figure 8) and confirmed the 
activity of these two enzymes to be 3.6- and 3 .5- fold higher than WT, respectively. 
These increases in activity arose from both increased transition state stabilization, with 
k ml s 2- and 2.3-fold greater than WT, respectively, and from greater substrate binding, 
with K h p 1 .8- and 1 .5-fold lower than WT, respectively. 

Remarkable results were obtained for L217C-SpGlc(Ac) 3 . This enzyme 
had a k cul 2.3-fold greater than WT and a K M 3.6-fold lower than WT, giving a kJK M 8.4- 
fold greater than WT and some 2.5-fold greater than the value estimated by the screen. 
The difference in parameters obtained from the screen and the full kinetic analysis 
exposed a limitation of the low substrate screen. For the low substrate approximation to 
be accurate, the substrate concentration must be small compared to The K M of 
L217C-SpGlc(Ac) 3 (0.15 mM) was evidently so small that the approximation did not hold 
in this case. This was the largest enhancement of activity relative to WT achieved using 
the combined site-directed mutagenesis and chemical modification strategy. 

Example 17 - Esterase Activity versus Amidase Activity. 

The differing effects of glycosylation upon amidase and esterase kJK M 
can be compared in an informative manner using the (KJKm)^^ I {kJK^^ ratio, 
E/A (see Figure 1 5). 
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All N62C deprotected glyco-CMMs had E/As that were enhanced relative 
to WT. The increase in E/A was dependent on the presence of either an a- or a P-linkage 
with the p-linked CMMs N62C-SEtpGlc (-c) (See Figure 8), -SEtpGal (-e) (See Figure 
8), -SEtLac (-f) (See Figure 8) having higher E/As than the a-linked CMMs N62C- 
5 SEtaGlc (-b) (See Figure 8) and -SEtaMan (-d) (See Figure 8) (Figure 1 5A). These 
increased ratios were due to both increases in esterase KJK K p and reductions in amidase 
k c JK A fi. 

As observed for the modifications made at position 62, glycosylation at 
position 217, in the S," pocket, led to enzymes that had greatly increased E/A ratios 

10 relative to WT. Mutation to cysteine at position 21 7 increased E/A to 6.4-fold greater 
than WT. Modification with unacetylated p-linked sugars -SpGlc, -SEtpGlc (-c) (See 
Figure 8), -SEtpGal (-e) (See Figure 8), -SEtLac (-1) (See Figure 8) further increased E/A 
(Figure 15B). In contrast, a-linked glyco-CMMs had lower E/A values than that of the 
mutant. These changes in E/As were a result of parallel changes in esterase k c JK A fi that 

15 were further amplified by opposing changes in amidase kJK h p. Introduction of an ethyl 
linker reduced the E/A from 10.9-fold greater than WT for L217C-SpGlc to 7.6-fold 
greater than WT for L217C-SEtpGlc (-c) (See Figure 8). 

At position 156, in the S, pocket, E/A ratios for unacetylated glyco-CMMs 
were all similar to each other in the range 1 .2- to 1 .6-fold greater than WT (Figure 1 5C). 

20 The mutant S 1 66C had an exceptionally low E/A that was 4.2-fold lower 

than WT (Figure 1 5D) and this was largely a result of its very low esterase k c JK h1 . 
Because modification of S166C restored esterase kJK^ to levels approaching that of 
WT and because S166C glyco-CMMs were poor amidases relative to WT, the net result 
was a family of CMMs with similar E/A ratios that were all enhanced relative to WT and 

25 significantly higher than the cysteine mutant (Figure 1 5D). 

Example 18 - Effects of Glycosylation with Acetylated Carbohydrates on E/A. 

At position 62, in the S, pocket, with the exception of N62C-SEtpGal(Ac) 3 
and N62C-SEtaMan(Ac) 4 (-i) (See Figure 8), acetylation of N62C glyco-CMMs led to a 
30 reduction in E/A. However, like their deprotected counterparts, the acetylated N62C 
glyco-CMMs all had larger E/As than WT (Figure 16A). Increasing the level of 
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acetylation increased the E/A ratio from 2.1 -fold greater than WT for N62C- 
SEtpGlc(Ac), to 3.0-fold greater than WT for N62C-SEt|3Glc(Ac) 3 . In spite of the 
general increases in E/A observed for the ethyl linked glyco-CMMs, the untethered CMM 
N62C-SpGlc(Ac) 4 (-a) (See Figure 8) had an E/A very similar to WT. 

At position 217, in the S,' pocket, L217C acetylated glyco CMMs all had 
enhanced E/As relative to WT (Figure 16B). As for their deprotected counterparts, (J- 
linked acetylated glyco-CMMs had higher E/As than that of the L21 7C mutant, whereas 
the E/As of the a-linked acetylated glyco-CMMs were lower than that of the mutant. In 
fact, modification at this site produced CMMs with by far the greatest enhancement of 
this ratio and the E/A for L2 1 7C-SpGlc(Ac) 3 was 1 7.2-fold greater than WT. In contrast 
to the L217C deprotected glyco-CMMs, introducing an ethyl linker lowered E/A from 
17.2-fold greater than WT for L217C-SpGlc(Ac) 3 to 9.2-fold greater than WT for L217C- 
SEtpGlc(Ac) 3 . For the untethered L21 7C acetylated glyco-CMMs, increasing the number 
of acetates also increased E/A, from 5.0-fold greater than WT for L217C-SEtpGlc(Ac) 2 to 
15 1 7.2-fold greater than WT for L2 1 7C-SEtpGlc(Ac) 3 . 

At position 156, in the S, pocket, acetylation of a-linked glyco CMMs 
increased E/As (Figure 16C). Hence, S156C-SEtpGlc(Ac) 4 (-h) (See Figure 8) had an 
E/A that was 1 .5 fold greater than WT whereas SI 56C-SEtaGlc(Ac) 4 (-g) (See Figure 8) 
had an E/A 2.0-fold greater than WT. S156C-SEtaMan(Ac) 4 (-i) (See Figure 8) also had 
an E/A 1.3-fold greater than that of its unacetylated counterpart, S156C-SEtaMan (-d) 
(See Figure 8). 

At position 166, in the S, pocket, acetylation caused an increase in E/A for 
the glucosylated CMMs, irrespective of the anomeric stereochemistry (Figure 16D). 
Acetylation of all other sugar moieties led to a reduction in E/A. Increasing the number 
of acetates increased E/A from 2.7-fold greater than WT for S166C-SEtpGlc(Ac) 2 to 3.2- 
fold greater than WT for S166C-SEtpGlc(Ac) 3 . 

Example 19 - Molecular Modeling 

The X-ray structure of subtilisin Bacillus lentus with the peptide inhibitor 
30 AAPF bound (Brookhaven database entry 1 JEA) was used as the starting point for 

calculations on wild type and CMMs. The enzyme setup was performed with Insight II, 
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version 2.3.0 (Biosym Technologies, Inc. San Diego, CA). To create initial coordinates 
for the minimization, hydrogens were added at the pH 8.6 used for kinetic measurements. 
This protonated all Lys and Arg residues and the N-terminus and deprotonated all Glu and 
Asp residues and the C-terminal carboxyl group. The protonated form of His 64 was used 
in all calculations. The model system was solvated with a 5 A layer of water molecules. 
The total number of water molecules in the system was 1 143. The overall charge of the 
enzyme-inhibitor complex resulting from this setup was +4 for the WT enzyme. Energy 
simulations were performed with the DISCOVER program, Version 2.9.5 (Biosym 
Technologies, Inc., San Diego, CA) on a Silicon Graphics Indigo computer, using the 
consistent valence force field (CVFF) function. A non-bonded cutoff distance of 18 A 
with a switching distance of 2 A was employed. The non-bonded pair list was updated 
every 20 cycles and a dielectric constant of 1 was used in all calculations. The WT 
enzyme was minimized in stages, with initially only the water molecules being allowed to 
move, followed by water molecules and the amino acid side chains, and then finally the 
entire enzyme. The mutated and chemically modified enzymes were generated by 
modifying the relevant amino acid using the Builder module of Insight. These structures 
were then minimized in a similar manner. Initially, the side-chain of the mutated residue 
and the water molecules were minimized. Then, all side-chains and the water molecules 
were minimized while the backbones of the residues were constrained, then all of the 
atoms were minimized. The AAPF inhibitor was free to move throughout all stages of 
the minimization. Each stage of energy minimization was conducted by means of the 
method of steepest descents without Morse or cross terms until the derivative of energy 
with respect to structural perturbation was less than 5.0 kcal/A; then, the method of 
conjugate gradients, without Morse or cross terms until the derivative of energy with 
respect to structural perturbation was less than 1 .0 kcal/A; and, finally, the method of 
conjugate gradients, with Morse and cross terms until the final derivative of energy with 
respect to structural perturbation was less than 0.1 kcal/A. 

The molecular basis for the vastly improved esterase activities observed 
was analyzed by molecular modeling of the peptidyl product inhibitor AAPF bound to the 
SBL-CMM L217C-SPG1c(Ac) 3 . While the substrate employed for kinetic analysis is 
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succinylated, this moiety was not included in molecular modeling since its orientation 
was not reported in the X-ray structure of SBL, suggesting high mobility in the crystal. 

As shown in Figures 17 and 18, molecular modeling analysis of L217C- 
SPGlc(Ac) 5 revealed that the observed kJK M changes correlate with the occupation of the 
S,' pocket of SBL by the glucosylated -SpGlc(Ac)., side chain. In the minimized structure 
(Figure 1 7), the position of the glucose moiety was fixed by a network of hydrogen- 
bonding interactions (shown as white dotted lines) between the oxygen atoms on the C-3, 
4, and 6 substituents of glucose and water molecules in the surrounding external solvent. 
This extensive solvation directed the C-2 substituent of glucose internally towards the 
catalytic triad. In this orientation, the carbonyl oxygen atom of the C-2 acetate group acts 
a hydrogen bond (1 .89 A) acceptor and stabilizes water molecule 127 in close proximity 
to the carboxy terminus of AAPF, used here as a substrate analog for modeling, and is 
shown in Figure 1 7 hydrogen-bonding to the O atom of the carboxylic acid. 

These results suggested that firstly the low amidase activity of L217C-S- 
Glc(Ac) 3 , which was 2-fold lower than WT, was a result of the S,' pocket being occupied 
by the glucose moiety at position 217. This prevents efficient binding of thepNA leaving 
group and, therefore, decreases the rate of acyl-enzyme intermediate formation which is 
the rate-determining step for amidase activity. Secondly, after thepNA has been 
displaced to form the covalent Acyl-Ser221 intermediate, the glucose moiety stabilizes a 
crucial, nucleophilic water molecule (Wat 127) in close proximity to the carbonyl carbon 
atom, through a hydrogen-bond to the oxygen of the C-2 acetate of glucose, as illustrated 
in Figure 18. This facilitates hydroylsis of the acyl-enzyme intermediate and therefore 
increases the rate of deacylation, which is the rate limiting step for esterase activity 
(Zemer et al., 1 Am. Chem. Soc. 86:3674-3679 (1964); Whitaker et al., J. Am. Chem. 
Soc.. 87:2728-2737 (1965); Berezin et al., FEBS Lett. . 15:121-124 (1971), which are 
hereby incorporated by reference). 

Glycosylation of SBL at sites within the active site can dramatically 
enhance its esterase activity. The library of glycosylated CMMs synthesized using the 
combined site directed mutagenesis and chemical modification strategy contains 22 
CMMs with greater than WT activity. Glycosylation at positions 62, in the S 2 pocket, and 
217, in the S,' pocket, gave the greatest increases in kJK„ The most active CMM 
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L2 1 7C-SpGlc(Ac), had a kJK„ that was 8.4-fold greater that WT and was the most 
active esterase synthesized using this approach. When surface exposed position 1 56 was 
glycosylated, there was little alteration in activity, and this demonstrates that the 
introduction of sugars at such sites has little effect on the catalytic activity of SBL. 

In addition to the tailoring of esterase kJK M values, glycosylation also led 
to enormous improvements in specificity for ester versus amide hydrolysis, as determined 
by measurement of the ratio (kJKJ^ I (kJKJ midae , E/A. This ratio has been 
increased to 1 7.2-fold greater than WT for L21 7C-SpGlc(Ac) 3 . Such enzymes are very 
attractive candidates for use in peptide synthesis, where a high esterase to amidase ratio is 
desirable. Furthermore, the CMMs described above have an even greater potential for 
this purpose, as the increases in E/As have been achieved by an increase in the catalytic 
efficiency of these enzymes towards ester substrates, in addition to a reduction in the 
amidase activity. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations can 
be made therein by those skilled in the art without departing from the spirit and scope of 
the invention which is defined by the following claims. 
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what IS CLAIMED : 

1 . A chemically modified mutant protein, said mutant protein 
comprising a cysteine residue substituted for a residue other than cysteine in a precursor 
protein, the substituted cysteine residue being subsequently modified by reacting said 
cysteine residue with a glycosylated thiosulfonate. 

2. A chemically modified mutant protein according to claim 1, 
wherein the protein is an en2yme. 

3 . A chemically modified mutant protein according to claim 2, 
wherein the enzyme is a protease. 

4. A chemically modified mutant protein according to claim 3, 
wherein the protease is a Bacillus lentus subtilisin. 

5 . A chemically modified mutant protein according to claim 1 , 
wherein said thiosulfonate comprises an alkylthiosulfonate. 

6. A chemically modified mutant protein according to claim 5, 
wherein said alkylthiosulfonate comprises methanethiosulfonate. 

7. A chemically modified mutant protein according to claim 1 , 
wherein the residue otherthan cysteine is an amino acid selected from the group 
consisting of asparagine, leucine, and serine. 

8. A chemically modified mutant protein according to claim 1 , 
wherein the residue other than cysteine is in a substrate binding subsite of the protein. 

9. A chemically modified mutant protein according to claim 1 , 
wherein the glycosylated thiosulfonate comprises a thiol side chain comprising -S-P-Glc, 
-S-Et-P-Gal, -S-Et-p-Glc, -S-Et-ct-Glc, -S-Et-a-Man, -S-Et-Lac, -S-p-Glc(Ac) 2 , 

30 -S-P-Glc(Ac),, -S-P-Glc(Ac) 4 , -S-Et-o>Glc(Ac) 2 , -S-Et-a-Glc(Ac),, -S-Et-a-Glc(Ac) 4 , 
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-S-Et-p-Glc(Ac),, -S-Et-p-Glc(Ac),, -S-Et-|J-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , 
-S-Et-a-Man(Ac) 4 , -S-Et-p-Gal(Ac) 3 , -S-Et-(J-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , 
or -S-Et-Lac(Ac) 7 . 

1 0. A chemically modified mutant protein according to claim 1 , 
wherein the carbohydrate moiety is a dendrimer moiety. 

11. A method of producing a chemically modified mutant protein 
comprising the steps of: (a) providing a precursor protein; (b) substituting an amino acid 
residue other than cysteine in said precursor protein with a cysteine; (c) reacting said 
substituted cysteine with a glycosylated thiosulfonate, said glycosylated thiosulfonate 
comprising a carbohydrate moiety; and (d) obtaining a modified glycosylated protein 
wherein said substituted cysteine comprises a carbohydrate moiety attached thereto. 

12. A method according to claim 1 1 , wherein said thiosulfonate 
comprises an alkylthiosulfonate. 

13. A method according to claim 12, wherein said alkylthiosulfonate 
comprises a methanethiosulfonate. 

14. A method according to claim 11, wherein the protein is an enzyme. 

15. A method according to claim 1 4, wherein the enzyme is a protease. 

16. A method according to claim 1 5, wherein the protease is a Bacillus 
lentus subtilisin. 

17. A method according to claim 1 1 , wherein the amino acid residue 
other than cysteine is an amino acid selected from the group consisting of asparagine, 
leucine, and serine. 
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1 8. A method according to claim 1 1 , wherein the amino acid residue 
other than cysteine is in a substrate binding subsite of the protein. 

1 9. A method according to claim 1 1 , wherein the glycosylated 
thiosulfonate comprises a thiol side chain comprising -S-P-Glc, -S-Et-P-Gal, -S-Et-p-Glc, 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-|5-Glc(Ac) 2 , -S-p-Glc(Ac) 3 , -S-P-Glc(Ac) 4 , 
-S-Et-cc-Glc(Ac),, -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , -S-Et-P-Glc(Ac) 3 , 
-S-Et-p-Glc(Ac),, -S-Et-p-Glc(Ac) 4 , -S-Et-a-Man(Ac),, -S-Et-a-Man(Ac) 4 , 
-S-Et-P-Gal(Ac),, -S-Et-P-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

20. A method according to claim 1 1 , wherein the carbohydrate moiety 
is a dendrimer moiety. 
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21. A glycosylated thiosulfonate comprising: 

O 
II 

H 3 C— S— SR 
O 

wherein R comprises -P-Glc, -Et-P-Gal, -Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-P-Glc(Ac) 2 , -P-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-o>Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-P-Glc(Ac) 2 , -Et-P-Glc(Ac) 3 , -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , 
-Et-p-Gal(Ac) 3 , -Et-P-Gal(Ac) 4> -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

22. A method of modifying the functional characteristics of a protein 

comprising: 

providing a protein and 

reacting the protein with a glycosylated thiosulfonate reagent under 
conditions effective to produce a glycoprotein with altered functional characteristics as 
compared to the protein. 



23 A method according to claim 22, wherein the protein is an enzyme. 
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24. A method according to claim 23, wherein the enzyme is a protease. 

25. A method according to claim 24, wherein the protease is a Bacillus 
lentus subtilisin. 

5 

26. A method according to claim 22, wherein the glycosylated 
thiosulfonate comprises: 

O 
II 

H 3 C— S— SR 
O 

wherein R comprises -P-Glc, -Et-p-Gal, -Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
10 -P-Glc(Ac),, -p-Glc(Ac) 3 , -p-Glc(Ac) 4 , -Et-o>Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac),, 
-Et-p-Glc(Ac) 2 , -Et-P-Glc(Ac),, -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , 
-Et-p-Gal(Ac) 3 , -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

27. A method of determining the structure-function relationships of 
15 chemically modified mutant proteins comprising: 

providing first and second chemically modified mutant proteins according 
to claim 1, wherein the glycosylation pattern of the second chemically modified mutant 
protein is different from the glycosylation pattern of the first chemically modified mutant 
protein; 

20 evaluating a functional characteristic of the first and second chemically 

modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

25 

28. A method according to claim 27, wherein the protein is an enzyme. 

29. A method according to claim 28, wherein the enzyme is a protease. 
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30. A method according to claim 29, wherein the protease is a Bacillus 
lenius subtilisin. 

31. A method according to claim 27, wherein the residue other than 
cysteine is an amino acid selected from the group consisting of asparagine, leucine, and 
serine. 

32. A method according to claim 27, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 



A method according to claim 27, wherein the glycosylated 
thiosulfonate comprises a thiol side chain comprising -S-fJ-Glc, -S-Et-p-Gal, -S-Et-p-Glc, 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-(3-Glc(Ac) 2 , -S-fJ-Glc(Ac) 3 , -S-p-Glc(Ac),, 
-S-Et-a-Glc(Ac) 2 , -S-Et-a-Glc(Ac),, -S-Et-a-Glc(Ac) 4 , -S-Et-p-Glc(Ac) 2 , 
-S-Et-P-Glc(Ac) 3 , -S-Et-P-Glc(Ac),, -S-Et-a-Man(Ac),, -S-Et-a-Man(Ac) 4 , 
-S-Et-P-Gal(Ac) 3 , -S-Et-P-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

34. A method according to claim 27, wherein the carbohydrate moiety 
is a dendrimer moiety. 

35. A method of determining the structure-function relationships of 

chemically modified mutant proteins comprising: 

providing first and second chemically modified mutant proteins according 
to claim 1 , wherein at least one different cysteine residue in the second chemically 
modified mutant enzyme is modified by reacting said cysteine residue with a glycosylated 
25 thiosulfonate; 

evaluating a functional characteristic of the first and second chemically 

modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
30 mutant proteins. 
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36. A method according to claim 35. wherein the protein is an enzyme. 



37. A method according to claim 36, wherein the enzyme is a protease. 

5 38. A method according to claim 37, wherein the protease is a Bacillus 

lentus subtilisin. 

39. A method according to claim 35, wherein the residue other than 
cysteine is an amino acid selected from the group consisting of asparagine, leucine, and 

10 serine. 

40. A method according to claim 35, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 

I5 41 . A method according to claim 35, wherein the glycosylated 

thiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-P-Gal, -S-Et-p-Glc, 
-S-Et-ot-Glc, -S-Et-a-Man, -S-Et-Lac, -S-P-Glc(Ac) 2 , -S-p-Glc(Ac) 3 , -S-P-Glc(Ac) 4 , 
-S-Et-ct-Glc(Ac) 2 , -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , -S-Et-p-Glc(Ac) 2 , 
-S-Et-p-Glc(Ac) 3 , -S-Et-P-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , -S-Et-a-Man(Ac) 4 , 

20 -S-Et-p-Gal(Ac) 3 , -S-Et-p-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

42. A method according to claim 35, wherein the carbohydrate moiety 
is a dendrimer moiety. 
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FIGURE 3 
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CHEMICALLY MODIFIED PROTEINS WITH A CARBOHYDRATE MOIETY 
rPOSS-RFFERENCF TO RELAT FD APPLICATIONS 

This application claims the benefit of U.S. Provisional Patent Application 
Serial No. 60/091,687, filed July 2, 1998, and U.S. Provisional Patent Application Serial 
5 No. 60/1 3 1 ,446, filed April 28, 1 999, and which are hereby incorporated by reference. 

F1FI n OF THE INVENTION 

The present invention relates to chemically modified mutant proteins 
having modified glycosylation patterns with respect to a precursor protein from which 

1 o they are derived. In particular, the present invention relates to a chemically modified 

mutant protein including a cysteine residue substituted for a residue other than cysteine in 
a precursor protein, the substituted cysteine residue being subsequently modified by 
reacting the cysteine residue with a glycosylated thiosulfonate. The present invention 
also relates to a method of producing the chemically modified mutant proteins and a 

15 glycosylated methanethiosulfonate. Another aspect of the present invention is a method 
of modifying the functional characteristics of a protein by reacting the protein with a 
glycosylated methanethiosulfonate reagent. The present invention also relates to methods 
of determining the structure-function relationships of chemically modified mutant 



20 



25 



proteins. 

RACKCROUND OF THE I NVENTION 

Modifying enzyme properties by site-directed mutagenesis has been 
limited to natural amino acid replacements, although molecular biological strategies for 
overcoming this restriction have recently been derived (Cornish et al., Angew. Chem. , Int. 
Ed. Engl., 34:621-633 (1995)). However, the latter procedures are difficult to apply in 
most laboratories. In contrast, controlled chemical modification of enzymes offers broad 
potential for facile and flexible modification of enzyme structure, thereby opening up 
extensive possibilities for controlled tailoring of enzyme specificity. 
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Changing enzyme properties by chemical modification has been explored 
previously, with the first report being in 1966 by the groups of Bender (Polgar et al., L 
Am Chem. Soc . 88:3153-3154 (1966)) and Koshland (Meet et al., Proc Natl. Acad. Sci. 
USA. 56:1606-161 1 (1966)), who created a thiolsubtilisin by chemical transformation 
5 (CH,OH -» CH,SH) of the active site serine residue of subtilisin BPN' to cysteine. 
Interest in chemically produced artificial enzymes, including some with synthetic 
potential, was renewed by Wu (Wu et al., 1 Am. Chem. Soc. 1 1 1:4514-4515 (1989); Bell 
et al.. Biochemistry . 32:3754-3762 (1993)) and Peterson (Peterson et al.. Biochemistry , 
34:6616-6620 (1995)), and, more recently. Suckling (Suckling et al., Bioorp. Med. Chem. 

10 Lett., 3:531-534 (1993)). 

Enzymes are now widely accepted as useful catalysts in organic synthesis. 
However, natural, wild-type, enzymes can never hope to accept all structures of synthetic 
chemical interest, nor always be transformed stereospecifically into the desired 
enantiomerically pure materials needed for synthesis. This potential limitation on the 
1 5 synthetic applicabilities of enzymes has been recognized, and some progress has been 
made in altering their specificities in a controlled manner using the site-directed and 
random mutagenesis techniques of protein engineering. However, modifying enzyme 
properties by protein engineering is limited to making natural amino acid replacements, 
and molecular biological methods devised to overcome this restriction are not readily 
20 amenable to routine application or large scale synthesis. The generation of new 

specificities or activities obtained by chemical modification of enzymes has intrigued 
chemists for many years and continues to do so. 

U.S. Patent No. 5,208,158 to Bech et al. ("Bech") describes chemically 
modified detergent enzymes where one or more methionines have been mutated into 
25 cysteines. The cysteines are subsequently modified in order to confer upon the enzyme 
improved stability towards oxidative agents. The claimed chemical modification is the 
replacement of the thiol hydrogen with C^alkyl. 

Although Bech has described altering the oxidative stability of an enzyme 
through mutagenesis and chemical modification, it would also be desirable to develop one 
30 or more enzymes with altered properties such as activity, nucleophile specificity, 

substrate specificity, stereoselectivity, thermal stability, pH activity profile, and surface 
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binding properties for use in, for example, detergents or organic synthesis. In particular, 
enzymes, such as subtilisins, tailored for peptide synthesis would be desirable. Enzymes 
useful for peptide synthesis have high esterase and low amidase activities. Generally, 
subtilisins do not meet these requirements and the improvement of the esterase to amidase 
selectivities of subtilisins would be desirable. However, previous attempts to tailor 
enzymes for peptide synthesis by lowering amidase activity have generally resulted in 
dramatic decreases in both esterase and amidase activities. Previous strategies for 
lowering the amidase activity include the use of water-miscible organic solvents (Barbas 
et al.. j a™ Chem. Soc . 1 10:5162-5166 (1988); Wong et al, 1 Am. Chem. Soc , 
1 12 945-953 (1990): and Sears et al., BjotgchnoL Prog-, 12:423-433 (1996)) and site- 
directed mutagenesis (Abrahamsen et al., Biochemistry, 30:4151-4159 (1991); Bonneau 
et al "Alteration of the Specificity of Subtilisin BPN' by Site-Directed Mutagenesis in its 
SI and SI' Binding-Sites," 1 Am. Chem. Soc, 1 13:1026-1030 (1991); and Graycar et al., 
Ann N V. Acad. Sci. . 67:71-79 (1992)). However, while the ratios of esterase-to- 
amidase activities were improved by these approaches, the absolute esterase activities 
were lowered concomitantly. Abrahamsen et al., Biochemistry , 30:4151-4159 (1991). 
Chemical modification techniques (Neet et al., Proc. Nat. Acad. Sci. , 56:1606 (1966); 
Polgar et al., I a™ Chem. Soc , 88:3153-3154 (1966); Wu et al., 1 Am. Chem. Soc , 
1 1 1:4514-4515 (1989); and West et al., 1 Am. Chem. Soc, 1 12:5313-5320 (1990)), 
which permit the incorporation of unnatural amino acid moieties, have also been applied 
to improve esterase to amidase selectivity of subtilisins. For example, chemical 
conversion of the catalytic triad serine (Ser221) of subtilisin to cysteine (Neet et al., Proc 
Nat. Acad. Sci.. 56:1606(1966); Polgar et al., 1 Am. Chem. Soc, 88:3153-3154 (1966); 
and Nakatsuka et al., 1 *™ Chem. Soc. 109:3808-3810 (1987)) or to selenocysteine 
(Wu et al., i a. Chem. Soc . 1 1 1 :4514-4515 (1989)), and methylation of the catalytic 
triad histidine (His57) of chymotrypsin (Westet al., J Am. Chem. Soc , 112:5313-5320 
(1990)), effected substantial improvement in esterase-to-amidase selectivities. 
Unfortunately however, these modifications were again accompanied by 50- to 1000-fold 

decreases in absolute esterase activity. 

Surface glycoproteins act as markers in cell-cell communication events 
that determine microbial virulence (Sharon et al., Fssavs Biochem. , 30:59-75 (1995)), 
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inflammation (Lasky, Ann,, Rev, Biochem.. 64:1 13-139 (1995); Weis et al„ Annu. Rev. 
Biochem. . 65:441-473 (1996)), and host immune responses (Varki, Glycobiol, 3:97-130 

(1993) : Dwek, Chem. Rev. , 96:683-720 (1996)). In addition, the correct glycosylation of 
proteins is critical to their expression and folding (Helenius, Mol. Biol. Cell. 5:253-265 

( 1 994) ) and increases their thermal and proteolytic stability (Opdenakker et al., FASEB J. ? 
7:1330-1337 (1993))- Glycoproteins occur naturally in a number of forms (glycoforms) 
(Rademacher et al., Ann,, R.v. Biochem. . 57:785-838 (1988)) that possess the same 
peptide backbone, but differ in both the nature and site of glycosylation. The differences 
exhibited (Rademacher et al, Ann,, Rev. Biochem.. 57:785-838 (1988); Parekh et al.. 
Riochem. . 28:7670-7679 (1989); Knight, Biotechnol., 7:35-40 (1989)) by each 
component within these microheterogeneous mixtures present regulatory difficulties (Liu, 
Trends Biotechnol. . 10:1 14-120 (1992); Bill et al., Chem. Biol. , 3:145-149 (1996)) and 
problems in determining exact function. To explore these key properties, there is a 
pressing need for methods that will not only allow the preparation of pure glycosylated 
proteins, but will also allow the preparation of non-natural variants for the determination 
of structure-function relationships, such as structure-activity relationships (SARs). The 
few studies that have compared single glycoforms successfully have required abundant 
sources and extensive chromatographic separation (Rudd et al., Biochem. , 33:17-22 
(1994)). Neoglycoproteins (Krantz et al., Biochem., 15:3963-3968 (1976)), formed via 
unnatural linkages between sugars and proteins, provide an invaluable alternative source 
of carbohydrate-protein conjugates (For reviews see Stowell et al., Adv. Carbohydr. 
Chem. Biochem.. 37:225-281 (1980); Neoplycoconjugates: Proration and Applications , 
Lee et al, Eds, Academic Press, London (1994); Abelson et al. Methods Enzymol, 242: 
(1994); Lee et al. Methods Enzvmol, 247: (1994); Bovin et al, Chem. Soc. Rev, 
24:413-421 (1995)). In particular, chemical glycosylation allows control of the glycan 
structure and the nature of the sugar-protein bond. However, despite these advantages, 
existing methods for their preparation (Stowell et al. Adv. Carbohydr. Chem. Biochem, 
37:225-281 (1980)) typically generate mixtures. In addition, these techniques may alter 
the overall charge of the protein (Lemieux et al, .1. Am. Chem. Soc, 97:4076-4083 
(1975); Kobayashi et al. Methods Enzvmol, 247:409-41 8 (1994)) or destroy the cyclic 
nature of glycans introduced (Gray, Arch Biochem. Biophys, 1 63:426-128 (1 974)). For 
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example, the reductive amination of lactose with bovine serum albumin (BSA) caused 
indiscriminate modification of lysine residues through the formation of acyclic amines 
introduced (Gray, Arch TWhem. Biophvs. . 163:426-428 (1974)). Advances in the site- 
specific glycosylation of BSA have been made (Davis et al., Tetrahedron Lett., 32:6793- 

5 6796 ( 1 99 1 ): Wong et al., Biochem. J- 300:843-850 ( 1 994): Macindoe et al., J . Chem. 
W Chem. Commun. . 847-848 (1998)). However, these methods rely upon modification 
of an existing cysteine in BSA and, as such, allow no flexibility in the choice of 
glycosylation site. Glycoproteins occur naturally as complex mixtures of differently 
glycosylated forms which are difficult to separate. To explore their properties, there is a 

l o need for homogenous sources of carbohydrate-protein conjugates. Existing methods 
typically generate product protein mixtures of poorly characterized composition, with 
little or no control over the site or level of glycosylation. 

The present invention is directed to overcoming these deficiencies. 

15 SUMMARY OF THR INVENTION 

It is an object of the present invention to provide for novel glycosylated 

proteins. 

It is a further object of the invention to provide for novel glycosylated 
proteins which have modified or improved functional characteristics. 
20 h is a further object of the invention to provide for a method of producing 

glycosylated proteins which have well defined properties, for example, by having 
predetermined glycosylation patterns. 

According to the present invention, a method is provided wherein the 
glycosylation pattern of a protein is modified in a predictable and repeatable manner. 
25 Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
30 in said precursor protein, the substituted cysteine residue being subsequently modified by 
reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
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glycosylated thiosulfonate is an alkylthiosulfonate. most preferably a 

methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 

chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate, said glycosylated thiosulfonate comprising a carbohydrate 
moiety: and (d) obtaining a modified glycosylated protein wherein said substituted 
cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. 

The present invention also relates to novel glycosylated thiosulfonates. In 
,5 a preferred embodiment, the glycosylated thiosulfonate is a methanethiosulfonate. In a 
most preferred embodiment, the glycosylated methanethiosulfonate comprises a chemical 
structure including: 



H 3 C-S— SR 
O 

where R comprises -(J-Glc, -Et-fJ-Gal, -Et-(J-Glc, -Et-oc-Glc, -Et-a-Man, -Et-Lac, 
20 -|K31c(Ac) 2 , -P-Glc(Ac),, -p-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-p-Glc(Ac),, -Et-p-Glc(Ac) 3 , -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-o>Man(Ac) 4 , 
-Et-|3-Gal(Ac) 5 , -Et-P-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including reacting the protein with a glycosylated 
25 thiosulfonate reagent under conditions effective to produce a glycoprotein with altered 
functional characteristics as compared to the protein. Accordingly, the present invention 
provides for modified protein, wherein the protein comprises a wholly or partially 
predetermined glycosylation pattern which differs from the glycosylation pattern of the 
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protein in its precursor, natural, or wild type state and a method for producing such a 
modified protein. 

The present invention also relates to methods of determining the structure- 
function relationships of chemically modified mutant proteins. One method includes 
providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein 
differs from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. Another method involves providing first and second chemically 
modified mutant proteins of the present invention, wherein at least one different cysteine 
residue in the second chemically modified mutant protein is modified by reacting said 
cysteine residue with a glycosylated thiosulfonate, evaluating a functional characteristic 
of the first and second chemically modified mutant proteins, and correlating the 
functional characteristic of the first and second chemically modified mutant proteins with 
the structures of the first and second chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
an alternative to site-directed mutagenesis and chemical modification for introducing 
unnatural amino acids into proteins. Moreover, the methods of the present invention 
allow the preparation of pure glycoproteins (i.e., not mixtures) with predetermined and 
unique structures. These glycoproteins can then be used to determine structure-function 
relationships (e.g., structure-activity relationships ("SARs")) of non-natural variants of 
the proteins. 

An advantage of the present invention is that it is possible to introduce 
predetermined glycosylation patterns into proteins in a simple and repeatable manner. 
This advantage provides an ability to modify critical protein characteristics such as 
partitioning, solubility, cell-cell signaling, catalytic activity, biological activity and 
pharmacological activity. Additionally, the methods of the present invention provide for 
a mechanism of "masking" certain chemically or biologically important protein sites, for 
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example, sites which are critical for immunological or allergenic response or sites which 
are critical to proteolytic degradation of the modified protein. 

Another advantage of the present invention is the ability to glycosylate a 
protein which is not generally glycosylated, or to modify the glycosylation pattern of a 
5 protein which is generally glycosylated. 

Another advantage of the present invention is to produce enzymes which 
have altered catalytic activity. In one specific example, the inventors herein have shown 
that it is possible to modify the substrate specificity of a protease to increase the esterase 
activity as compared to the amidase activity. Similarly, modifications of substrate 
10 specificity would be expected when utilizing the present invention with other enzymes. 

These and other advantages of the present invention are described in more 
detail in the following detailed description. 

BRIEF DESCRIPTION OF THE DRAWINGS 

15 Figure 1 shows dendrimer methanethiosulfonate ("MTS") reagents. 

Figure 2 shows the synthesis of a first generation glycodendrimer reagent 
which bears two D-mannose units on its termini and has one arm as a MTS which can be 
attached to a subtilisin Bacillus lentus cysteine mutant. 

Figure 3 shows the synthesis of highly-functionalized glycodendrimer- 

20 protein conjugates. 

Figure 4 shows two parallel synthetic approaches to modification of 
subtilisin Bacillus lentus with dendrimers. Both approaches allow the use of a large 
library of methanethiosulfonate reagents (R-SS0 2 Me) to cap the dendrimeric branches. 
The routes shown allow for the preparation of both dimeric and trimeric dendrimers 
25 Figure 5 shows peptide coupling catalyzed by an enzyme. 

Figure 6 shows the preparation of two types of glycosylating reagents from 
D-glucose (2a): the anomeric methanethiosulfonate la and the ethyl -tethered 
methanethiosulfonates lb, c, g, h. 

Figure 7 shows the preparation of the a-D-manno-MTS reagents Id and li, 
30 which are epimeric at C-2 relative to lb and lg, respectively, and the p-D-galacto-MTS 
reagents le and lj, epimeric at C-4 relative to lc and lh, respectively. 
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Figure 8 shows the reaction of the thiol residue of cysteine introduced into 
subtilisin Bacillus lentus with glycomethanethiosulfonate reagents. 

Figure 9 shows the reaction of 2,3,4.6-tetra-O-acetyl-p-D-glucopyranosyl 
methanethiosulfonate (la) with subtilisin Bacillus /emus-N62C, -S156C, -S166C, - 
5 L217C. 

Figures 10A-D show deprotected glycan structure-proteolytic activity 
SARs of subtilisin Bacillus lentus cysteine mutants and glycosylated chemically modified 
mutant enzymes ("CMMs") relative to wild-type ("WT"). A break in the axis indicates 
that the value was not determined. At position 62, glycosylation partially restores the 
,0 decrease in K JK^, caused by mutation to cysteine (R = H) (Figure 10A). At position 217, 
the 4-fold decrease in activity caused by mutation (R = H) is amplified to 6-fold lower 
than WT by glycosylation with untethered S-p-Glc, but reduced to around 2.5-fold lower 
than WT by glycosylation with ethyl-tethered glycans (b-f) (Figure 10B). At position 
1 56, an arced variation in activity reaches a 3-fold lower than WT minimum kJK M at 
15 bulky lacto-CMM S156C-S-f (Figure 10C). At position 166, the 2.5-fold decrease in 

KJK„ caused by mutation is amplified by glycosylation. kJK hi decreases monotonically 
from S166C (R = H) to a value that is 3.8-fold lower than WT for S 1 66C-S-f (Figure 
10D). 

Figures 1 1 A-D show the acetylated glycan structure-proteolytic activity 
20 SARs of glycosylated chemically modified mutant enzymes relative to WT. For each 
glycan the number of acetate groups present is indicated by a label on the corresponding 
bar. A break in the axis indicates that the value was not determined. At positions 62 
(Figure HA), 217 (Figure 11B), and 1 66 (Figure 1 1 D) an alternating trend inactivity is 
observed as a result of the opposite effects of acetylation upon kJK h , according to 
25 anomeric stereochemistry (see Figure 12). This results in a kJK At for N62C-S-g that is 
1 . 1 -fold higher than WT. At position 1 56 (Figure 1 1 C), variations are slight and this is 
consistent with its surface exposed orientation. 

Figures 12A-D show the variation in proteolytic activity of glycosylated 
chemically modified mutant enzymes of subtilisin Bacillus lentus upon acetylation of 
30 glycans. Comparison of the activity of acetylated with fully deprotected chemically 

modified mutant enzymes shows that at positions 62 (Figure 12A) and 21 7 (Figure 12B) 
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acetylation enhances the activity of a-tethered chemically modified mutant enzymes but 
decreases that of P-tethered. In contrast, at position 166 (Figure 12D), acetylation 
decreases the activity of a-tethered CMMs but increases that of p-tethered. Consistent 
with its surface exposed orientation, changes at position 156 (Figure 12C) are modest. 
5 For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 

Figures 13A-D show esterase kJK.js for deprotected glyco-CMMs 
relative to WT. At position 62 (Figure 13A), in the S, pocket, glycosylation leads to a 
series of enzymes that have similar activities that are 1 .3- to 1 .9-fold greater than WT. 
10 At position 217 (Figure 13B), in the S,' pocket, glycosylation also increases kJK,„ to a 
maximum 3.5-fold greater than WT for L217C-SEtGal (-e). At position 1 56 (Figure 
1 3C), in the S, pocket, glycosylation leads to a reduction in kJK Kl . At position 1 66 
(Figure 13D), in the S, pocket, the dramatic loss of activity upon mutation to cysteine (R 
= H) is restored by glycosylation. All five S166C deprotected glyco-CMMs have similar 
1 5 kJK^ that are 1 . 1 - to 1 .4-fold lower than WT. 

Figures 14A-D show the effect of acetylation on k c JK M of glyco-CMMs. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. At 
positions 62 (Figure 14A), in the S 2 pocket, and 217 (Figure 14B), in the S," pocket, the 
20 effect of acetylation is dependent on anomeric stereochemistry. At both sites, acetylation 
of a-linked sugars (-b,-d) leads to an increase in kJK^ whereas kJK M is decreased for 
P-linked sugars (-c,-e,-f). kJK M also increases as the number of acetates increases. 
Consistent with the surface-exposed nature of its side chain, acetylation at position 156 
(Figure 14C), in the S, pocket, has very little effect on kJK M . At position 166 (Figure 
25 1 4D), in the S , pocket, the effects of acetylation are opposite to those observed at 
positions 62 and 217. Acetylation increases kJK M of p-linked glyco-CMMs (-c,-e), 
while causing a decrease for the a-linked glyco-CMMs (-b,-d). 

Figures 15A-D show the E/A of deprotected glyco-CMMs relative to WT. 
A break in the axis indicates that the value was not determined. Glycosylation with 
deprotected reagents lb-f increases the E/A ratio in all cases. The greatest effects are 
observed at positions 62, in the S, pocket (Figure 15A), and 217, in the S,' pocket (Figure 
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1 5B), where the largest increases in E/A result in values up to 1 0.9-fold greater than WT. 
At both sites the E/A is higher for the P-linked glyco-CMMs than the a-linked ones. At 
positions 156 (Figure 15C) and 166 (Figure 15D), in the S, pocket, there is little variation 
in E/A. 

Figures 16A-D show the effect of introducing acetylated glycans on E/A. 
For each glycan the number of acetate groups present is indicated by a label on the 
corresponding bar. A break in the axis indicates that the value was not determined. 
Values greater than zero indicate acetylation increases E/A, negative values denote a 
reduction in E/A upon acetylation. 

Figure 1 7 shows the modeling of the high esterase activity of L2 1 7C-S- 
Glc(Ac),. A minimized structure of L217C-S-Glc(Ac) 3 of the active site of SBL showing 
the catalytic residues Ser221, His64. The carbon atoms of the triacetyated D-glucose 
moiety, which is bound to Leu217 via a disulfide bond, are numbered. The phenyl ring of 
product AAPF occupies the S, binding site and forms a crucial hydrogen bond (1 .72 A) 
to Wat 127. This water molecule is further stabilized by a second hydrogen bond (1 .89 A) 
to the carbonyl O of the C-2 acetate group of glucose. 

Figure 18 shows the proposed acyl-enzyme intermediate of L217C-S- 
Glc(Ac),. The carboxy terminus of AAPF forms a bond to the O t atom of Ser221 . 
Wat 127 acting as the crucial deacylating nucleophilic water molecule, is stabilized 
20 through its hydrogen bond to the carbonyl group of the C-2 acetate group. 

nFT All FD DESCRIPTION OF THF INVENTION 
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According to the present invention, a method is provided wherein the 
glycosylation pattern of a protein is modified in a predictable and repeatable manner. 
Generally, the modification of the protein occurs via reaction of a cysteine residue in the 
protein with a glycosylated thiosulfonate. 

Thus, in one composition aspect of the present invention, a chemically 
modified mutant protein is provided, wherein said mutant protein differs from a precursor 
protein by virtue of having a cysteine residue substituted for a residue other than cysteine 
in said precursor protein, the substituted cysteine residue being subsequently modified by 
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reacting said cysteine residue with a glycosylated thiosulfonate. Preferably, the 
glycosylated thiosulfonate is an alkylthiosulfonate, most preferably, a 
methanethiosulfonate. 

In a method aspect of the present invention, a method of producing a 

5 chemically modified mutant protein is provided comprising the steps of: (a) providing a 
precursor protein; (b) substituting an amino acid residue other than cysteine in said 
precursor protein with a cysteine; (c) reacting said substituted cysteine with a 
glycosylated thiosulfonate, said glycosylated thiosulfonate comprising a carbohydrate 
moiety; and (d) obtaining a modified glycosylated protein wherein said substituted 

1 o cysteine comprises a carbohydrate moiety attached thereto. Preferably, the glycosylated 
thiosulfonate is an alkylthiosulfonate, most preferably, a methanethiosulfonate. Also 
preferably, the substitution in said precursor protein is obtained by using recombinant 
DNA techniques by modifying a DNA encoding said precursor protein to comprise DNA 
encoding a cysteine at a desired location within the protein. The amino acid residues to 

1 5 be substituted with cysteine residues according to the present invention may be replaced 
using site-directed mutagenesis methods or other methods well known in the art. See, for 
example, PCT Publication No. WO 95/10615, which is hereby incorporated by reference. 

The present invention also relates to a glycosylated thiosulfonate. 
Preferably, the glycosylated thiosulfonate comprises methanethiosulfonate. More 

20 preferably, the methanethiosulfonate comprises the chemical structure: 

O 
II 

H 3 C— |— SR 
O 

where R comprises -p-Glc, -Et-{J-Gal, -Et-P-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-P-Glc(Ac),, -P-Glc(Ac) 3 , -P-Glc(Ac) 4 , -Et-a-Glc(Ac),, -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-p-Glc(Ac),, -Et-p-Glc(Ac),, -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-o>Man(Ac) 4 , 
25 -Et-p-Gal(Ac).,, -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 
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Accordingly, the present invention provides for modified protein, wherein the protein 
comprises a wholly or partially predetermined glycosylation pattern which differs from 
the glycosylation pattern of the protein in its precursor, natural, or wild type state and a 
method for producing such a modified protein. As used herein, glycosylation pattern 
5 means the composition of a carbohydrate moiety. The present invention also relates to 
methods of determining the structure-function relationships of chemically modified 
mutant proteins. The first method includes providing first and second chemically 
modified mutant proteins of the present invention, wherein the glycosylation pattern of 
the second chemically modified mutant protein differs from the glycosylation pattern of 
10 the first chemically modified mutant protein, evaluating a functional characteristic of the 
first and second chemically modified mutant proteins, and correlating the functional 
characteristic of the first and second chemically modified mutant proteins with the 
structures of the first and second chemically modified mutant proteins. The second 
method involves providing first and second chemically modified mutant proteins of the 
1 5 present invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 
modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
20 chemically modified mutant proteins. 

The chemically modified mutant proteins of the present invention provide 
a valuable source of carbohydrate-protein conjugates. Moreover, the methods of the 
present invention allow the preparation of pure and glycoproteins (i.e., not mixtures) with 
predetermined and unique structures. These glycoproteins can then be used to determine 
25 structure-function relationships (e.g., structure-activity relationships ("SARs")) of non- 
natural variants of the proteins. 

The protein of the invention may be any protein for which a modification 
of the glycosylation pattern thereof may be desirable. For example, proteins which are 
naturally not glycosylated may be glycosylated via the invention. Similarly, proteins 
30 which exist in a naturally glycosylated form may be modified so that the glycosylation 
pattern confers improved or desirable properties to the protein. Specifically, proteins 



WO 00/01712 



PCT/US99/15138 



14- 



10 



useful in the present invention are those in which glycosylation plays a role in functional 
characteristics such as, for example, biological activity, chemical activity, 
pharmacological activity, or immunological activity. 

Glycosylated proteins as referred to herein means moieties having 
carbohydrate components which are present on proteins, peptides, or amino acids. In the 
present invention, the glycosylation is provided, for example, as a result of reaction of the 
glycosylated thiosulfonate with the thiol hydrogen of a cysteine residue thereby producing 
an amino acid residue which has bound thereto the carbohydrate component present on 
the glycosylated thiosulfonate. 

Another aspect of the present invention is a method of modifying the 
functional characteristics of a protein including providing a protein and reacting the 
protein with a glycosylated thiosulfonate reagent under conditions effective to produce a 
glycoprotein with altered functional characteristics as compared to the protein. 

The functional characteristics of a protein which may be altered by the 
1 5 present invention include, but are not limited to, enzymatic activity, the effect on a human 
or animal body, the ability to act as a vaccine, the tertiary structure (i.e., how the protein 
folds), whether it is allergenic, its solubility, its signaling effects, its biological activity, 
and its pharmacological activity (Paulson, "Glycoproteins: What are the Sugar Chains 
For?", Trends in Biochem. Sciences , 14:272-276 (1989), which is hereby incorporated by 
20 reference). The use of glycosylated thiosulfonates as thiol-specific modifying reagents in 
the method of the present invention allows virtually unlimited alterations of protein 
residues. In addition, this method allows the production of pure glycoproteins with 
predetermined and unique structures and, therefore, unique functional characteristics, with 
control over both the site and level of glycosylation. In particular, the method of 
25 modifying the functional characteristics of a protein allows the preparation of single 
glycoforms through regio- and glycan-specific protein glycosylation at predetermined 
sites. Such advantages provide an array of options with respect to modification of protein 
properties which did not exist in the prior art. The ability to produce proteins having very 
specific and predictable glycosylation patterns will enable the production of proteins 
30 which have known and quantifiable effects in chemical, pharmaceutical, immunological, 
or catalytic performance. For example, with knowledge of a specific problematic epitope, 
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it would be possible to construct a modified protein according to the present invention in 
which the epitope is masked by a carbohydrate moiety, thus reducing its allergenic or 
immunogenic response in a subject. As another example, where the solubility of a protein 
is problematic in terms of recovery or formulation in a pharmaceutical or industrial 
application, it may be possible, utilizing the present invention, to produce a protein which 
has altered solubility profiles thus producing a more desirable protein product. As 
another example, if a protein has particular problem of being proteolytically unstable in 
the environment in which it is to be used, then it may be possible to mask the proteolytic 
cleavage sites in the protein using the present invention to cover up such a site with a 
carbohydrate moiety. These examples are merely a few of the many applications of the 
present invention to produce improved proteins. 

In a preferred embodiment, the protein is an enzyme. The term "enzyme" 
includes proteins that are capable of catalyzing chemical changes in other substances 
without being changed themselves. The enzymes can be wild-type enzymes or variant 
1 5 enzymes. Enzymes within the scope of the present invention include pullulanases, 

proteases, cellulases, amylases, isomerases, lipases, oxidases, and reductases. Preferably, 
the enzyme is a protease. The enzyme can be a wild-type or mutant protease. Wild-type 
proteases can be isolated from, for example, Bacillus lentus or Bacillus amyloliquefaciens 
(also referred to as BPN')- Mutant proteases can be made according to the teachings of, 
for example, PCT Publication Nos. WO 95/10615 and WO 91/06637, which are hereby 
incorporated by reference. Functional characteristics of enzymes which are suitable for 
modification according to the present invention include, for example, enzymatic activity, 
solubility, partitioning, cell-cell signaling, substrate specificity, substrate binding, 
stability to temperature and reagents, ability to mask an antigenic site, physiological 
functions, and pharmaceutical functions (Paulson, "Glycoproteins: What are the Sugar 
Chains For?", Trends in Biochem. Sciences . 14:272-276 (1989), which is hereby 

incorporated by reference). 

The protein is modified so that a non-cysteine residue is substituted with a 
cysteine residue, preferably by recombinant means. Preferably, the amino acids replaced 
in the protein by cysteines are selected from the group consisting of asparagine, leucine, 



20 
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The terms "thiol side chain group," '"thiol containing group " and "thiol 
side chain" are terms which are can be used interchangeably and include groups that are 
used to replace the thiol hydrogen of a cysteine used to replace one of the amino acids in a 
protein. Commonly, the thiol side chain group includes a sulfur through which the thiol 
side chain groups defined above are attached to the thiol sulfur of the cysteine. 

The glycosylated thiosulfonates of the invention are those which are 
capable of reacting with a thiol hydrogen of a cysteine to produce a glycosylated amino 
acid residue. By glycosylated is meant that the thiosulfonate has bound thereto a sugar or 
carbohydrate moiety which can be transferred to a protein pursuant to the present 
invention. Preferably, the glycosylated thiosulfonates are glycosylated 
alkylthiosulfonates, most preferably, glycosylated methanethiosulfonates. Such 
glycosylated methanethiosulfonate have the general formula: 

O 
II 

H 3 C— S— SR 
O 

In particularly preferred embodiment, the methanethiosulfonate comprises 
an R group which comprises: -0-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, 
-Et-Lac, -P-Glc(Ac),, -p-Glc(Ac) 3 , -p-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , 
-Et-a-Glc(Ac) 4 , -Et-p-Glc(Ac) 2 , -Et-p-Glc(Ac),, -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , 
-Et-a-Man(Ac) 4 , -Et-p-Gal(Ac) 3 , -Et-P-Gal(Ac) 4 , -Et-Lac(Ac) 3 , -Et-Lac(Ac) 6 , or -Et- 
Lac(Ac) 7 . 

In a preferred embodiment, the carbohydrate moiety of the present 
invention is a dendrimer moiety. Multiple functionalization of chemically modified 
mutant proteins can be achieved by dendrimer approaches, whereby multiple-branched 
linking structures can be employed to create poly-functionalized chemically modified 
mutant proteins. 

Highly branched molecules or dendrimers were first synthesized by Vogtle 
in 1978 (Buhleier et al., Synthesis. 155-158 (1978), which is hereby incorporated by 
reference). The attachment of identical building blocks that contain branching sites to a 
central core may be achieved with a high degree of homogeneity and control. Each 
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branch contains a functional group which, after chemical alteration, may be connected to 
yet another branching building block. In this manner, layer after layer of branching 
rapidly generates highly-functionalized molecules. 

For instance, multiple glycosylation, including multiple mannose- 
containing chemically modified mutant proteins, and varied sugar moieties, can be 
created. The dendrimer reagent structures would include methanethiosulfonates with 
simple branching such as: 

CH 3 SQ 2 S 1 X 

rrz. 



derived from pentaerythritol, to very complex branched dendrimer reagents (see Figure 
10 1). 

In particular, a first generation glycodendrimer reagent is synthesized as 
shown in Figure 2. This approach can be extended to cover larger dendrimers. More 
specifically, by leaving one "arm" of the glycodendrimer free for conversion to a 
methanethiosulfonate, the remaining arms can be further branched to synthesize highly- 
1 5 functionalized glycodendrimer reagents as shown in Figure 3. Through further branching 
and by using different carbohydrates, this concept can be extended to virtually unlimited 
levels. 

A flexible synthetic strategy for the synthesis of core dendrimeric 
methanethiosulfonate building blocks that may be used either in situ or before 
20 modification to construct dendrimers is shown in Figure 4. 

The present invention also relates to glycosylated thiosulfonate 
compositions. Preferably the glycosylated thiosulfonates are methanethiosulfonates and 
comprise a chemical structure: 

O 
II 

H 3 C— S— SR 
O 

25 wherein R comprises -P-Glc, -Et-p-Gal, -Et-f5-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, -fJ- 

Glc(Ac) 2 , -p-Glc(Ac) 3 , -p-Glc(Ac),, -Et-ct-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , -Et- 
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P-Glc(Ac) 2r -Et-p-Glc(Ac)., -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , -Et-£- 
Gal(Ac) 3 , -Et-P-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

The present invention also relates to a method of determining the structure- 
function relationships of chemically modified mutant proteins. This method involves 

5 providing first and second chemically modified mutant proteins of the present invention, 
wherein the glycosylation pattern of the second chemically modified mutant protein is 
different from the glycosylation pattern of the first chemically modified mutant protein, 
evaluating a functional characteristic of the first and second chemically modified mutant 
proteins, and correlating the functional characteristic of the first and second chemically 

10 modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

Evaluating a functional characteristic of the first and second chemically 
modified mutant protein includes testing for functional characteristics including, but not 
limited to, stability to temperature and reagents, solubility, partitioning, enzymatic 

15 activity, cell-cell signaling, substrate specificity, substrate binding, ability to mask an 
antigenic site, physiological functions, and pharmaceutical functions (Paulson, 
"Glycoproteins: What are the Sugar Chains For?", Trends in Biochem. Sciences, 14:272- 
276 (1989), which is hereby incorporated by reference). 

Another aspect of the present invention is a second method of determining 

20 the structure-function relationships of chemically modified mutant proteins. This method 
involves providing first and second chemically modified mutant proteins of the present 
invention, wherein at least one different cysteine residue in the second chemically 
modified mutant protein is modified by reacting said cysteine residue with a glycosylated 
thiosulfonate, evaluating a functional characteristic of the first and second chemically 

25 modified mutant proteins, and correlating the functional characteristic of the first and 
second chemically modified mutant proteins with the structures of the first and second 
chemically modified mutant proteins. 

By way of example to illustrate some of its advantages, the following 
discussion will focus on certain proteases which are modified according to the methods o 

30 the present invention. Alkaline serine proteases (subtilisins) are finding increasing use in 
biocatalysis, particularly in chiral resolution, regioselective acylation of polyfunctional 
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compounds, peptide coupling, and glycopeptide synthesis. As shown in Figure 5, 
subtilisins can catalyze peptide bond formation starting from an ester substrate, by first 
forming an acyl enzyme intermediate which then reacts with a primary amine to form the 
peptide product. This application requires high esterase activity to promote acyl enzyme 

5 formation and low amidase activity to minimize hydrolysis of the peptide bond of the 
desired product. Generally, subtilisins do not meet these requirements. However, the 
improvement of the esterase to amidase selectivities of subtilisins has been a long sought 
after goal. By using the methods provided for in the present invention, it is possible to 
produce subtilisins which have advantageous properties. 

, 0 The inventors in the present case used site specific mutagenesis to modify 

certain residues and introduce additional cysteine residues within subtilisin which would 
then serve to react with a glycosylated methanethiosulfonate to produce a glycosylation 
point at the introduced cysteine. Bacillus lentus subtilisin was selected for illustrative 
purposes because it does not contain a natural cysteine and is not naturally glycosylated. 

15 The substrate binding site of an enzyme consists of a series of subsites 

across the surface of the enzyme. The portion of substrate that corresponds to the subsites 
are labeled P and the subsites are labeled S. By convention, the subsites are labeled S„ 
S 2 , S 3 , S 4 , S,\ and S 2 \ A discussion of subsites can be found in Berger et al., Phil. Trans. 
Rov Soc. Lond. B . 257:249-264 (1970), Siezen et al., Protein Engineering, 4:719-737 

20 (1991), and Fersht, F.nzvme Structure and Mechanism . 2 ed., Freeman: New York, 29-30 
( 1 985), which are hereby incorporated by reference. 

In the present illustration, the S„ S,', or S 2 subsites were selected as 
suitable targets for modification. In particular, the amino acids corresponding to N62, 
L217, SI 56, and SI 66 in naturally-occurring subtilisin from Bacillus amyloliquefaciens 

25 or to equivalent amino acid residues in other subtilisins, such as Bacillus lentus subtilisin, 
were selected for modification to cysteine. The mutated subtilisin was produced through 
standard site directed mutagenesis techniques and the obtained mutant subtilisin was 
reacted with certain glycosylated alkylthiosulfonates, particularly glycosylated 
methanethiosulfonates, as provided in the examples appended hereto. 

30 Enzymatic peptide coupling is an attractive method for preparation of a 

variety of peptides, because this method requires minimal protection of the substrate, 
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proceeds under mild conditions, and does not cause racemization. Wong et al., Enzymes 
in Synthetic Organic Chemistry , Pergamon Press: Oxford. 41-130 (1994), which is hereby 
incorporated by reference. In spite of these advantages, two major problems have limited 
the use of serine proteases in peptide synthesis. One is their efficient proteolytic 
5 (amidase) activity which causes hydrolysis of the coupling product, and the other is their 
stringent structural specificity and stereospecificity. 

Surprisingly, it was found that the chemically modified mutant subtilisins 
of the present invention have altered esterase-to-amidase activity as compared to the 
precursor enzyme. Increasing the esterase-to-amidase ratio enables the use of the enzyme 
10 to more efficiently catalyze peptide synthesis. In particular, subtilisins can catalyze 
peptide bond formation starting from an ester substrate (i.e. an acyl donor), by first 
forming an acyl enzyme intermediate which then reacts with a primary amine (i.e. an acyl 
acceptor) to form the peptide product, as shown in Figure 5. This reaction thus requires 
high esterase activity to promote acyl enzyme formation and, then, low amidase activity 
1 5 to minimize hydrolysis of the peptide bond of the desired product. The chemically 
modified mutant subtilisins produced according to the present invention show an 
increased esterase-to-amidase ratio, without reducing the absolute esterase activity of the 
enzyme. In addition, certain modified enzymes of the present invention even show a 
concomitant increase in the absolute esterase activity. 

Therefore, an unexpected benefit of subtilisins which are modified 
according to the present invention is that they can be used in organic synthesis to, for 
example, catalyze a desired reaction and/or favor a certain stereoselectivity. See e.g., 
Noritomi et al. Biotech. Bioeng. 51 :95-99 (1996); Dabulis et al. Biotech. Bioeng. 41 :566- 
571 (1993), and Fitzpatrick et al. I Am. Chem. Soc. 1 13:3166-3171 (1991), which are 
25 hereby incorporated by reference. 

Proteins obtained using the methods provided herein may be used in any 
application in which it is desired to use such proteins, where having modified functional 
capabilities is advantageous. Thus, proteins modified as provided herein may be used in 
the medical field for pharmaceutical compositions and in diagnostic preparations. 
30 Additionally, proteins such as enzymes which are modified according to the present 
invention may be used in applications which are generally known for such enzymes 
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including industrial applications such as cleaning products, textile processing, feed 
modification, food modification, brewing of grain beverages, starch processing, as 
antimicrobials, and in personal care formulations. Moreover, the unique functionalities 
made possible by the present invention may result in uses for proteins which have not 
5 heretofore been recognized as feasible. 

EXAMPLES 

Example 1 -Preparation of Methanethiosulfonate ("MTS") Reagents 

|0 The preparation of NaSSO,CH, (Kenyon et al., Methods Enzvmol., 

47:407-430 (1977), which is hereby incorporated by reference) has been described 
previously (Berglund et al., T Anv Chem. Soc . 1 19:5265-5266 (1997), which is hereby 
incorporated by reference). Acetobromoglucose (3) (See Figure 6) (prepared from D- 
glucose according to Scheurer et al., J. Am. Chem. Soc., 76:3224 (1954), which is hereby 

1 5 incorporated by reference) in 73% yield, pentaacetylglucose (prepared from the 

corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 
Chem. Ges., 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5d (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch , 

20 Chem. Ges. , 34:3354-33 58 ( 1 901 ), which is hereby incorporated by reference, and 

purified by flash chromatography) in 92% yield, 5e (See Figure 7) (prepared from the 
corresponding parent carbohydrates according to the method of Verley et al., Ber. Dtsch. 
Chem. Ges. , 34:3354-3358 (1901), which is hereby incorporated by reference, and 
purified by flash chromatography) in 99% yield, 5f (See Figure 7) (prepared from lactose 

25 according to the method of Hudson et al., J. Am. Chem. Soc, 37:1 270-1 275 (1 9 1 5), 

which is hereby incorporated by reference, and purified by flash chromatography in 82% 
yield) were prepared according to literature methods. N,N-dimethylformamide ("DMF") 
was distilled under N 2 from CaH 2 and stored over molecular sieve under N 2 before use. 
Methanol was distilled from Mg/I 2 under N 2 immediately prior to use. Br(CH 2 ) 2 OH was 

30 stood over and distilled from CaO under reduced pressure and stored under N 2 prior to 
use. All other chemicals were used as received from Sigma-Aldrich (St. Louis, MO) or 
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Baker (Phillipsburg, NJ). All Hash chromatography was performed using silica gel 
(Whatman, 60A, 230-400 Mesh, Clifton, NJ). Melting points were determined using an 
Electrothermal 1A9000 series digital melting point apparatus and are uncorrected. IR 
spectra were recorded on Bomem MB or Perkin-Elmer FT1R Spectrum 1000 

5 spectrophotometers. 'H NMR and ,5 C NMR spectra were recorded on Varian Gemini 
200, Unity 400 or Unity 500 NMR spectrometers at the frequencies indicated. Where 
indicated, NMR peak assignments were made using Correlation Spectroscopy ("COSY") 
or Distortionless Enhanced by Polarization Transfer ("DEPT") experiments, all others are 
subjective. All chemical shifts were referenced to residual solvent as an internal standard; 

10 for B C NMR in D,0 1 ,4-dioxan (67.6 ppm) was used. ES-MS data were acquired using a 
PE SCIEX API III Biomolecular mass spectrometer. All HRMS data were acquired using 
Micromass 70-250S or Micromass ZAB-SE mass spectrometers according to the 
ionization methods indicated. Solvents were removed in vacuo. 

1 5 Preparation of 2,3, 4, 6-Tetra-O-acetyl-^-D-glucopyranosyl methanethiosulfonate (la). 

Initial approaches to untethered glyco-MTS reagents similar in type to la 
(See Figure 8) were based upon Danishefsky's glycal methodology (Halcomb et al., h 
Am. Chem. Soc . 1 1 1 :666 1 -6666 ( 1 989), which is hereby incorporated by reference). 
Tris-TBS glucal was prepared according to the method of Lesimple et al., Tetrahedron 

20 Lett. . 27:620 1 -6204 (1 986), which is hereby incorporated by reference, and oxidized to 
tris-TBS protected 1 ,2-anhydroglucose using dimethyldioxirane. However, under a 
variety of conditions and in contrast to the behavior of other sulfur nucleophiles (Gordon 
et al., Carbohvdr. Res. . 206:361-366 (1990); Berkowitz et al, J. Am. Chem. Soc, 
1 14:4518-4529 (1992), which are hereby incorporated by reference), 

25 methanethiosulfonate ion failed to open the epoxide moiety of tris-TBS protected 1 ,2- 
anhydro-D-glucose. Deprotection of la (See Figure 8) was attempted under a variety of 
conditions, but in all cases led only to decomposition or hydrolysis of the thioglucosidic 
bond (Zemplen et al., R P r Dtsch. Chem. Ges. . 56:1705-1710 (1923); Plattner et al., J, 
Am. Chem. Soc. 94:8613-8615 (1972); Mori et al., Tetrahedron Lett., 20:1329-1332 
30 (1979); Herzig et al., Carhohvdr. Res. . 153:162-167 (1986); Herzig et al., J. Org. Chem. , 
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51:727-730 (1986); Vekemans et al., Tetrahedron Lett. . 28:2299-2300 (1987); Cinget et 
al., Svnlett, 168-170 (1993), which are hereby incorporated by reference). 

Acetobromoglucose (3) (See Figure 6) (1 g, 2.43 mmol) was added to a 
solution of NaSSOjCH, (380 mg, 2.84 mmol) in ethanol (4 mL) at 90°C under N 2 . After 

5 20 minutes the resulting suspension was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 9:1 1) and the resulting solid 
recrystallized from ether to give la (See Figure 8) (674 mg, 63%) as a white solid; mp 
1S1-152-C melts then decomp. (ether); [a] 27 D - -19.0 (c 1 .24, CHC1 3 ); IR (KBr) 1749 cm" 
' (C=0), 1333, 1 140 cm"' (S-SO,); 'H NMR (400 MHz, CDC1 3 ) 8 2.00, 2.04, 2.06, 2.07 (s 

,0 x 4, 3H x 4, Ac x 4), 3.44 (s, 3H, CH 3 SO r ), 3.82 (ddd, J 4 . 5 10.1 Hz, J u 5.9 Hz, J,. 6 . 2.2 
Hz, 1H, H-5), 4.08 (dd, J 5 . 6 5.9 Hz, J M . 12.5 Hz, 1H, H-6), 4.31 (dd, J 5jt . 2.2 Hz, J bA - 12.5 
Hz, 1H, H-6'), 5.05 (t, J 9.8 Hz, 1H, H-4), 5.07 (dd, J l2 10.5 Hz, J 23 9.4 Hz, 1H, H-2), 
5.25 (d, J X2 10.5 Hz,lH, H-l), 5.29 (t, J 9.3 Hz, 1H, H-3); ,3 C NMR (50 MHz, CDC1 3 ) 8 
20.5, 20.7 (CH 3 COO- x 4), 52.8 (CH 3 S0 2 -), 61.8, 68.0, 68.7, 73.3, 76.6 (C-2, C-3, C-4, 

15 C-5, C-6), 86.4 (C-l), 169.3, 169.3, 169.7, 170.1 (CH^OO- x 4); HRMS m/z (E1+): 
Found 443.0636 (M+H + ); C )5 H 23 O n S 2 requires 443.0682. 

Preparation of2-(2, 3, 4, 6-Tetra-0-acetyl-a-D-glucopyranosyl)ethyl methanethiosulfonate 
08)- 

20 BF 3 .Et,0 ( 1 45 uL, 1 . 1 mmol) was added dropwise to a suspension of D- 

glucose (2a) (See Figure 6) (1.45 g, 8.1 mmol) in Br(CH 2 ) 2 OH (19 mL) under N 2 and the 
resulting mixture heated to 1 05°C. After 8 hours, the resulting solution was cooled and 
the solvent removed. The residue was dissolved in Ac 2 0/pyridine (2:3 v/v, 16 mL) under 
N 2 . After a further 24 hours, the reaction solvent was removed and the residue purified by 

25 repeated flash chromatography (EtOAc then EtOAc : hexane, 3:7) to give 2-bromoethyl 
2,3,4,6-tetra-O-acetyl-a-D-glucopyranoside (4g) (See Figure 6) (1.76 g, 48%) as a 
colorless oil that crystallized on standing to give a white solid; mp 86-88°C; [a] 25 D = + 
130.6 (c 0.21, CHC1 3 ); IR (film) 1749 cm"' (C=0); 'H NMR (400 MHz, CDC1 3 ) 8 2.01, 
2.03, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.51 (t, J 5.9 Hz, 2H, -CH 2 Br), 3.83 (dt, J t 11. 6 

30 Hz, J, 5.8 Hz, 1H, -OCHH'-), 3.96 (dt, J A 1 1.6 Hz, J x 5.8 Hz, 1H, -OCHH>), 4.10 (dd, J 5A 
2.2Hz,J 6 . 6 - 12.0 Hz, 1H, H-6), 4.14 (ddd, J 4 . 5 10.2 Hz, J 5jb 2.2 Hz, J u . 4.4 Hz, 1H, H-5), 
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4.24 (dd, J 5 6 - 4.4 Hz, J 6A - 12.0 Hz, 1H, H-6 ? ), 4.84 (dd, J x2 3.8 Hz, J 2J 10.3 Hz, 1H, H-2), 
5.05 (l, J 9.7 Hz, 1H, H-4), 5.14 (d, 3.8 Hz, 1H, H-l), 5.49 (dd, J 2J 10.3 Hz, J 3A 9.5 
Hz, 1H, H-3); ,J C NMR (100 MHz, CDC1 3 ) 5 20.6, 20.7 (CH 3 COO- x 4), 29.9 (-CH 2 Br), 
61.9, 67.8, 68.5, 68.8, 70.0, 70.8 (-OCH,-, C-2, C-3, C-4, C-5, C-6), 96.0 (C-l), 169.6, 
5 1 70.0, 1 70.2, 1 70.6 (CH^COO- x 4); HRMS m/z (FAB+): Found 477.038 1 (M+Na + ); 
C l6 H 23 O 10 BrNa requires 477.0372. NaSSO,CH 3 (75 mg, 0.56 mmol) was added to a 
solution of 4g (See Figure 6) (190 mg, 0.42 mmol) in DMF (6 mL) under N 2 and warmed 
to 50 °C. After 21hours, the solution was cooled and the solvent removed. The residue 
was purified by flash chromatography (EtOAc : hexane, 1 :1) to give lg (See Figure 6) 
10 (1 83 mg, 90%) as a colorless oil; [ct] 27 D = + 92.1 (c 0.39, CHC1 3 ); IR (film) 1 748 cm"' 

(C=0), 1322, 1 134 cm' 1 (S-SO,); 'H NMR (400 MHz, CDC1 3 ) 5 2.01, 2.03, 2.07, 2.09 (s x 
4, 3H x 4, Ac x 4), 3.41 (t, ./5.7Hz, 2H, -CH 2 S-), 3.41 (s, 3H, CH 3 S0 2 -), 3.75 (dt, J„ 10.8 
Hz, J, 5.7 Hz, 1H, -OCHH'-), 3.99-4.06 (m, 2H, H-5, -OCHH1-), 4.09 (dd, J 5 . 6 2.4 Hz, J 6A - 
12.6 Hz, 1H, H-6), 4.25 (dd, J 5fi . 4.6 Hz, J 6 , 6 . 12.6 Hz, 1H, H-6'), 4.87 (dd, J,. 2 3.9 Hz, 
15 10.3 Hz, 1H, H-2), 5.06 (t, J 9.8 Hz, 1H, H-4), 5.12 (d, J u 3.9 Hz, 1H, H-l), 5.43 (t, J 9.8 
Hz, 1H, H-3); "C NMR (100 MHz, CDC1 3 ) 8 20.6, 20.6, 20.7, 20.7 (CH 3 COO- x 4), 36.0, 
(-CH 2 S-), 50.8 (CH 3 S0 2 -), 61.8 (-OCH 2 -), 67.0, 67.8, 68.3, 69.8, 70.7 (C-2, C-3, C-4, C-5, 
C-6), 96.0 (C-l), 169.5, 170.0, 170.6 (CHjCOO- x 4); HRMS m/z (FAB+): Found 
487.0946 (M+H*); C 17 H 27 0 12 S 2 requires 487.0944. 

20 

Preparation of 2-(a-D-Glucopyranosyl)ethyl methanethiosulfonate (lb). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4g 

(See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N 2 and stirred vigorously. 

After 6 hours, the resulting solution was passed through a Dowex 50W(H + ) plug (2 x 1 
25 cm, eluant MeOH) and the solvent removed to give 2-bromoethyl ot-D-glucopyranoside 

bromide (4b) (See Figure 6) (178 mg, 94%) (The synthesis of 4b as an intermediate has 

been described previously. However, this method gave only a poor yield of product. 

Nagai et al., Carbohvdr. Res. . 1 90: 1 65- 1 80 ( 1 989), which is hereby incorporated by 

reference) as a white solid. NaSS0 2 CH 3 (1 00 mg, 0.75 mmol) was added to a solution of 
30 4b (See Figure 6) ( 1 78 mg, 0.62 mmol) in DMF (7 mL) under N 2 and warmed to 50 °C. 

After 25 hours, the solution was cooled and the solvent removed. The residue was 
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purified by flash chromatography (MeOH : EtOAc, 1:9) to give lb (See Figure 6) (144 
mg, 73%) as a hygroscopic foam; [a] 27 D = + 109.9 (c 1.11, H 2 0); 1R (film) 3423 cm" 1 
(OH), 1 309, 1 1 28 cm"' (S-S0 2 ); 'H NMR (500 MHz, D 2 0, COSY) 8 3 . 1 6 (t, J 9.5 Hz, 1 H, 
H-4), 3.28 (t, J 5.9 Hz, 2H, -CH,S-), 3.30 (s, 3H, CH,SO r ), 3.3 1 (dd, J X2 3.8 Hz, J 23 9.9 
5 Hz, 1H, H-2), 3.44 (t, J 9.5 Hz, 1H, H-3), 3.47-3.53 (m, 2H, H-6, H-6'), 3.58-3.61 (m, 
1H, H-5), 3.62 (dt, J, 5.4 Hz, J 6 10.8 Hz, 1H, -OCHFT-), 3.79 (dt, J, 6.3 Hz,J d 10.8 Hz, 
1H, -OCHHI), 4.72 (d,J,. 2 3.8 Hz, 1H, H-l); ,3 C NMR (100 MHz, D,0) 8 36.7 (-CH 2 S-), 
50.7 (CH 3 S0 2 -), 61 .5 (-OCH 2 -), 67.2, 70.5, 72.3, 73.2, 74.0 (C-2, C-3, C-4, C-5, C-6), 
99.4 (C-l); HRMS m/z (FAB+): Found 319.0517 (M+H~); C 9 H, 9 O g S 2 requires 319.0521 . 

10 

Preparation of2-(2,3,4, 6-Tetra-0-acetyl-$-D-glucopyranosyl)ethyl methanethiosulfonate 
Oh). 

BF 3 .Et,0 (3.3 mL, 26.0 mmol) was added dropwise over the course of 15 
minutes to a solution of 1,2,3,4 ,6-penta-0-acetyl-a,P-D-glucose (2 g, 5.1 mmol) and 

1 5 Br(CH 2 ) 2 OH (0.45 mL, 6.3 mmol) in CH 2 C1 2 (9 mL) at 0 °C under N 2 . After 1 .5 hours, 
the solution was warmed to room temperature. After 20 hours the reaction solution was 
added to ice water (15 mL) and extracted with CH 2 C1 2 (1 5 mL x 3). These extracts were 
combined, washed with water (15 mL), sat. NaHCOj (aq. ; 15 mL), water (15 mL), dried 
(MgS0 4 ), filtered, and the solvent removed. The residue was purified by flash 

20 chromatography (EtOAc : hexane, 1 :3) to give 2-bromoethyl 2,3,4,6-tetra-O-acetyl-p-D- 
glucopyranoside (4h) (See Figure 6) (1.42 g, 61%) as a white solid; mp 1 18-120 °C 
(EtOAc/wo-octane) [lit., (Coles et al., J Am. Chem. Soc . 60:1020-1022 (1938), which is 
hereby incorporated by reference) mp 1 1 7.3 °C (EtOH)]; [ct] 27 D = - 1 1 .9 (c 1 .65, CHC1 3 ) 
[lit., (Helferich et al., Inst T.ieh. Ann. Chem. . 541 :1-16 (1939), which is hereby 

25 incorporated by reference) [cc] 20 D = - 12.3 (c 0.2, CHC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 8 
2.00, 2.02, 2.07, 2.09 (s x 4, 3H x 4, Ac x 4), 3.42-3.51 (m, 2H), 3.67-3.87 (m, 2H), 4.10- 
4.31 (m, 3H), 4.57 (d,J u 8 Hz, 1H, H-l), 4.97-5.27 (m, 3H). NaSSO,CH 3 (260 mg, 1.94 
mmol) was added to a solution of 4h (See Figure 6) (640 mg, 1 .41 mmol) in DMF (1 8 
mL) under N 2 and warmed to 50°C. After 25 hours, the solution was cooled and the 

30 solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 
1 :1) and the resulting solid recrystallized from EtOAc/hexane to give lh (See Figure 6) 



WO 00/01712 



PCT/US99/15138 



10 



15 



-26- 

(544 mg, 80%) as a white solid; mp 1 1 5-1 16°C (EtOAc/hexane); [a] 27 D = + 5.4 (c 1 .06, 
CHC1 3 ); IR (KBr) 1758, 1741 cm"' (C=0), 1314, 1 133 cm"' (S-S0 2 ); 'H NMR (500 MHz, 
CDC1 3 , COSY) 5 1 .99, 2.02, 2.06, 2.08 (s x 4, 3H x 4, Ac x 4), 3.30-3.38 (m, 2H, -CH,S- 
), 3.34 (s, 3H, CH 3 SO,-), 3.70 (ddd, J 4 5 9.9 Hz, J 56 2.2 Hz, J 5 . 6 - 4.6 Hz, 1H, H-5), 3.83 
(ddd, J 5.6 Hz, J 1A Hz, J 1 0.5 Hz, 1 H, -OCHH'-), 4.13-4.18 (m, 2H, H-6, -OCHH», 
4.24 (dd, J 5A . 4.6 Hz, J 6 . 6 . 12.4 Hz, 1H, H-6'), 4.55 (d, J u 8.1 Hz, 1H, H-l ), 4.98 (dd, J xa 
8.1 Hz, J 2J 9.7 Hz, 1H, H-2), 5.07 (t, J 9.9 Hz, 1H, H-4), 5.19 (t, J9.6 Hz, 1H, H-3); ,3 C 
NMR (125 MHz, CDC1 3 , DEPT) 6 20.5, 20.7 (q x 2, CH,COO- x 4), 36.0, (t, -CH 2 S-), 
50.6 (q, CH 3 SO r ), 61.6 (t, -OCH,-), 68.1, 70.8, 71.9, 72.5 (d x 4, C-2, C-3, C-4, C-5), 
68.4 (t, C-6), 100.8 (d, C-l), 169.3, 170.0, 170.5 (s x 3, CH^COO- x 4); HRMS m/z 
(FAB+): Found 487.0940 (M+H*); C 17 H„O l2 S 2 requires 487.0944. 



Preparation of 2-($*D-Glucopyranosyl)ethyi methanethiosulfonate (lc). 

A solution of NaOMe (0.1M, 0.3 mL) was added to a suspension of 4h 
(See Figure 6) (300 mg, 0.66 mmol) in MeOH (3 mL) under N 2 and stirred vigorously. 
After 4 hours, the resulting solution was passed through a Dowex 50 W(H + ) plug (2 x 1 
cm, eluant MeOH) and the solvent removed to give 2-bromoethyl p-D-glucopyranoside 4c 
(See Figure 6) (176 mg, 93%) as a white solid that was used directly in the next step. A 
sample was recrystallized from EtOH/EtOAc to give a colorless, crystalline solid; mp 74- 
20 78°C (EtOH/EtOAc) [lit., (Helferich et al., Just. Lieb. Ann. Chem., 541 :1-16 (1939), 

which is hereby incorporated by reference) mp 74-75°C (EtOH/EtO Ac)] ; [a] 26 D = - 22.4 
(c 1.63, H 2 0) [lit., (Helferich et al., Just T.ieh. Ann. Chem.. 541 :1-16 (1939), which is 
hereby incorporated by reference) [a]" D = - 26.1 (c 3.0, H 2 0)]; 'H NMR (400 MHz, 
CD 3 OD) 8 3.30 (t, J 8.4 Hz, 1H, H-2), 3.39-3.49 (m, 3H), 3.64-3.80 (m, 3H), 3.97 (br d, 
25 J 6 . 6 . 1 1.7 Hz, 1H, H-6'), 4.02 (dt, J K 6.5 Hz, J, 1 1 .3 Hz, 1H, -OCHH'-), 4.23 (dt, J t 6.5 Hz, 
y d 11.3Hz, lH,-OCHHl-),4.44(d, J u 7.9Hz, 1H,H-1). NaSS0 2 CH 3 (100 mg, 0.75 
mmol) was added to a solution of 4c (See Figure 6) (176 mg, 0.61 mmol) in DMF (7 mL) 
under N, and warmed to 50°C. After 1 5 hours, the solution was cooled and the solvent 
removed. The residue was purified by flash chromatography (MeOH : EtOAc, 1 :9) to 
30 give lc (See Figure 6) (144 mg, 74%) as a hygroscopic foam; [a] 27 D = - 1 5.8 (c 0.88, 
H 2 0); IR (KBr) 3400 an' (OH), 1 3 10, 1 13 1 cm"' (S-S0 2 ); 'H NMR (500 MHz, D 2 0, 
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COSY) 5 3.07 (dd, J 1.2 8.1 Hz, J,.j 9- 4 Hz, 1H, H-2), 3.16 (dd, J 34 9.0 Hz, •/„ 9.8 Hz, 1H, 
H-4), 3.24 (ddd, J 4 . 5 9.8 Hz, J iA 6.0 Hz, J u . 2.3 Hz, 1 H, H-5), 3.27 (t, J 9.0 Hz, 1 H, H-3), 
3.30-3.33 (m, 2H, -CH,S-), 3.34 (s, 3H, CH,SO r ), 3.50 (dd, J 5 . 6 6.0 Hz, J bA - 12.4 Hz, 1H, 
H-6), 3.69 (dd, J 56 - 2.3 Hz, J M - 1 2.4 Hz, 1 H, H-6'), 3.8 1 (dt, J, 5.8 Hz, J, 1 1 -5 Hz, 1 H, - 
5 OCHH'-), 4.00 (dt, J, 5.7 Hz, ./„ 1 1 .4 Hz, 1H, -OCHH», 4.30 (d, J X2 8.1 Hz, 1 H, H-l); 
l3 C NMR (50 MHz, D 2 0) 5 36.9 (-CH 2 S-), 51.0 (CH 3 S0 2 -), 62.0 (-OCH,-), 69.5, 70.9, 
74.3, 76.7, 77.3 (C-2, C-3, C-4, C-5, C-6), 103.7 (C-l); HRMS m/z (FAB+): Found 
341.0351 (M+Na + ); C 9 H 18 0 8 S 2 Na requires 341.0341. 

1 o Preparation o/2-(2, 3, 4, 6-Tetra-O-acetyl-a-D-mannopyranosyl) ethyl 
methanethiosulfonate (li). 

BF 3 .Et,0 (7.7 mL, 60.7 mmol) was added dropwise over the course of 1 5 
minutes to a solution of l,2,3,4,6-penta-0-acetyl-a,P-D-mannose (5d) (See Figure 7) (4.7 

,5 g, 12.1 mmol) and BKCH^OH (1.05 mL, 14.8 mmol) in CH 2 C1 2 (22 mL) at 0°C under 
N 2 . After 1 hour, the solution was warmed to room temperature. After 25 hours, the 
reaction solution was added to ice water (20 mL) and extracted with CH 2 C1 2 (20 mL x 2). 
These extracts were combined, washed with water (20 mL), sat. NaHCO, (aq., 20 mL), 
water (20 mL), dried (MgS0 4 ), filtered, and the solvent removed. The residue was 

20 crystallized from EtOAc//*>-octane to give 2-bromoethyi 2,3,4,6-tetra-O-acetyl-a-D- 
mannopyranoside (4i) (See Figure 7) (3.52 g, 64%). Purification of the resulting mother 
liquor by flash chromatography (EtOAchexane, 1 :3) gave further 4i (See Figure 7) (320 
mg, 6%; 70% in total) as a white highly crystalline solid; mp 121-123°C [lit., (Dahmen et 
ah, "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohydr. Res. , 

25 1 16:303-307 (1983), which is hereby incorporated by reference) 1 1 8-1 19°C (EtOAc/iso- 
octane)]; [a] 28 D - + 48.3 (c 1.3 1 , CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," Carbohvdr. Res.. 1 16:303-307 (1983), which is hereby 
incorporated by reference) [<x]* D - + 45 (c 0.6, CDC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 5 
1.99, 2.05, 2.10, 2.16 (s x 4, 3H x 4, Ac x 4), 3.52 (t, J 6 Hz, 2H, -CH,Br), 3.82-4.04 (m, 
30 2H, -OCH r ), 4.09-4.16 (m, 1H, H-5), 4.13 (dd, J J>6 2 Hz, J 6 . 6 . 12 Hz, 1H, H-6) 4.28 (dd, 
J 5A . 6 Hz, J M . 12 Hz, 1H, H-6'), 4.87 (br s, 1H, H-l), 5.22-5.40 (m, 3H, H-2, H-3, H-4). 
NaSSQ 2 CH 3 (230 mg, 1 .72 mmol) was added to a solution of 4i (See Figure 7) (600 mg, 
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I . 32 mmol) in DMF (17 mL) under N, and warmed to 55°C. After 20 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(EtOAc : hexane, 9:1 1) and the resulting solid recrystallized from Et,0/hexane to give li 
(See Figure 7) (566 mg, 88%) as a white solid; mp 128-129°C (Et,0/hexane); [tx] 27 D = + 

5 53.2 (c 0.92, CHCI : ,); IR (KBr) 1739 cm' 1 (CO), 1325, 1 129 cm' 1 (S-SO,); 'H NMR (500 
MHz, CDC1 3 , COSY) 6 1 .97, 2.04, 2.09, 2.14 (s x 4, 3H x 4, Ac x 4), 3.37-3.40 (m, 2H, - 
CH 2 S-), 3.38 (s, 3H, CH 3 S0 2 -), 3.79 (dt, J d 10.5 Hz, J, 5.8 Hz, 1H, -OCHH'-), 3.98-4.03 
(m, 2H, -OCHH1-, H-5), 4.09 (dd, J 5A 2.5 Hz, J 66 , 12.5 Hz, 1H, H-6), 4.26 (dd, J Sb - 5.6 
Hz, J 6A - 12.5 Hz, 1H, H-6'), 4.85 (d, J ia 0.7 Hz, 1H, H-l), 5.23-5.29 (m, 3H, H-2, H-3, H- 

10 4); 13 C NMR (50 MHz, CDC1 3 ) 6 20.6, 20.7, 20.8 (CH 3 COO- x 4), 35.7, (-CH 2 S-), 50.8 
(CH 3 SO,-), 62.5 (-OCH 2 -), 66.0, 66.8, 69.0, 69.2, 69.3 (C-2, C-3, C-4, C-5, C-6), 97.7 (C- 
1), 169.7, 169.9, 170.0, 170.6 (CHjCOO- x 4); HRMS m/z (FAB+): Found 487.0954 
(M+H + ); C l7 H 27 0 12 S, requires 487.0944. 

1 5 Preparation of 2-(<x-D-Mannopyranosyl)ethyl methanethiosulfonate (Id). 

A solution of NaOMe (0.143 M, 0.7 mL) was added to a suspension of 4i 
(See Figure 7) (1 g, 2.2 mmol) in MeOH (10 mL) under N 2 . After 3 hours, the resulting 
solution was passed through a Dowex 50W(H + ) plug (2 x 1 cm, eluant MeOH) and the 
solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 

20 2:25) to give 2-bromoethyl a-D-mannopyranoside 4d (See Figure 7) (The use of 4d as a 
reactant has been described previously, although no details of preparation or 
characterization were given. (U.S. Patent 4 918 009 to Nilsson, which is hereby 
incorporated by reference)) (606 mg, 96%) as a white foam; [a] 26 D = + 50.7 (c 0.91 , H 2 0); 
IR (KBr) 3417 cm' 1 (OH); 'H NMR (500 MHz, D 2 0, COSY) 8 3.38-3.44 (m, 3H, H-4, - 

25 CH,Br), 3.50-3.55 (m, 2H, H-5, H-6), 3.60 (dd, 3.5 Hz, J iA 9.7 Hz, 1H, H-3), 3.66 (dd, 
J i6 .4.6Hz,J 66 - 11.2 Hz, 1H, H-6'), 3.68 (ddd, J 4.6 Hz, J 5.4 Hz, J11.7Hz, 1H,- 
OCHH'-), 3.76 (dd, J,, 1.8 Hz, ^3.5 Hz, 1H, H-2), 3.81 (ddd, J 5A Hz, J6.5 Hz, J 

II. 7, 1H, -OCHH1-), 4.71 (d, J L2 1.8 Hz, 1H, H-l); ,3 C NMR (100 MHz, D 2 0) 6 32.1 (- 
CH,Br), 61.7 (-OCH,-), 67.5, 68.4, 70.7, 71.3, 73.8 (C-2, C-3, C-4, C-5, C-6), 100.5 (C- 

30 1 ); HRMS m/z (FAB+): Found 308.9985 (M+Na*); CgH.jO^rNa requires 308.9950. 
NaSSO,CH 3 (150 mg, 1.12 mmol) was added to a solution of 4d (See Figure 7) (245 mg, 
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0.85 mmol) in DMF (10 mL) under N, and warmed to 50°C. After 16 hours, the solution 
was cooled and the solvent removed. The residue was purified by flash chromatography 
(MeOH : EtOAc, 1 :9) to give Id (See Figure 7) (217 mg, 80%) as a hygroscopic foam; 
[a] 29 D = + 58.0 (c 1.34, H,0); IR (KBr) 3441 cm" 1 (OH), 1314, 1 132 cm"' (S-SO,); 'H 

5 NMR (500 MHz, D 2 0) 6 3.3 1 (t, J 5.8 Hz, 2H, -CH 2 S-), 3.35 (s, 3H, CH,SO r ), 3.45 (t, J 
9.6 Hz, 1 H, H-4), 3.49 (ddd, J 4J 9.8 Hz, J SA 5.8 Hz, J SJ> - 1 .9 Hz, 1 H, H-5), 3.55 (dd, J 56 5.8 
Hz, J bA - 12.1 Hz, 1H, H-6), 3.60 (dd, J 2Jt 3.4 Hz, J 3 . 4 9.0 Hz, 1H, H-3), 3.66 (dt, J d 10.7 Hz, 
J, 5.7 Hz, 1H, -OCHH'-), 3.69 (dd, J 56 - 1.9 Hz,J 66 - 12.1 Hz, 1H, H-6'), 3.77 (dd,J,. 2 1.6 
Hz, J 2 „, 3.4 Hz, 1H, H-2), 3.83 (dt, J d 1 1 .0 Hz, J y 5.9 Hz, 1H, -OCHH», 4.72 (d, J U2 1 .6 

10 Hz, 1H, H-l); l3 C NMR (125 MHz, D 2 0) 8 36.7 (-CH.S-), 50.7 (CH,SCy), 61.9 (-OCH,- 
), 66.7, 67.7, 70.9, 71.5, 74.0 (C-2, C-3, C-4, C-5, C-6), 100.8 (C-l); HRMS m/z (FAB+): 
Found 319.0528 (M+H + ); C 9 H l9 O g S 2 requires 319.0521. 



Preparation of2-(2,3, 4, 6-Tetra-0-acetyl-$-D-galactopyranosyl)ethyl 
15 methanethiosulfonate (lj). 

BF 3 .Et 2 0 (8.5 mL, 67.0 mmol) was added dropwise to a solution of of 
l,2,3,4,6-penta-0-acetyl-a,P-D-galactose (5e) (See Figure 7) (5.1 g, 13.1 mmol) and 
Br(CH 2 ) 2 OH (1.15 mL, 16.2 mmol) in CH 2 C1 2 (24 mL) at 0°C under N 2 . After 1 hour, the 

20 solution was wanned to room temperature. After 24 hours, the reaction solution was 

added to ice water (20 mL) and extracted with CH 2 C1 2 (30 mL x 3). These extracts were 
combined, washed with water (20 mL), sat. NaHCOj (aq., 20 mL), water (20 mL), dried 
(MgS0 4 ), filt Vogtle ered, and the solvent removed. The residue purified by flash 
chromatography (EtOAc : hexane, 1 : 3) to give 2-bromoethyl 2,3,4,6-tetra-0-acetyl-(5-D- 

25 galactopyranoside (4j) (See Figure 7) (4.01 g, 67%) as a white solid; mp 1 16-1 17°C 
(EtOAc/hexane) [lit., (Coles et ah, J. Am. Chem. Soc . 60:1020-1022 (1938), which is 
hereby incorporated by reference) 1 1 1°C; lit., (Dahmen et al., "2-Bromoethyl Glycosides 
- Synthesis and Characterization," CarbohydrRes,, 1 16:303-307 (1983), which is hereby 
incorporated by reference) 1 14-1 16°C (EtO Ac/light pet. ether)]; [a] 27 D - - 3.8 (c 0.81 , 

30 CHClj) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - Synthesis and 

Characterization," Carbohvdr. Res. . 1 16:303-307 (1983), which is hereby incorporated by 
reference) [tx] 23 D = - 5 (c 1.4, CDC1 3 )]; 'H NMR (200 MHz, CDC1 3 ) 5 1.98, 2.05, 2.08, 
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2.15 (s x 4, 3H x 4, Ac x 4), 3.43-3.50 (m, 2H), 3.75-3.95 (m, 2H), 4.12-4.24 (m, 3H), 
4.53 (d, •/,„, 8 Hz, 1H, H-l), 5.02 (dd, J 13 1 1 Hz, J iA 3 Hz, 1H, H-3), 5.23 (dd, J u 8 Hz, 
J,.3 1 1 Hz, 1H, H-2), 5.40 (br d, J iA 3 Hz, 1H, H-4). NaSS0 2 CH 3 (85 mg, 0.63 mmol) was 
added to a solution of 4j (See Figure 7) (223 mg, 0.49 mmol) in DMF (6 mL) under N 2 

5 and warmed to 55°C. After 30 hours, the solution was cooled and the solvent removed. 
The residue was purified by flash chromatography (EtOAc : hexane, 1 :1) to give lj (See 
Figure 7) (198 mg, 83%) as a white foam; [a] 2, D = + 9.1 (c 1.41, CHC1 3 ); IR (film) 1747 
cm 1 (C=0), 1320, 1 133 cm" 1 (S-S0 2 ); 'H NMR (500 MHz, CDC1 3 ) 6 1.98, 2.05, 2.09, 
2.15 (s x 4, 3H x 4, Ac x 4), 3.35 (s, 3H, CH 3 SO r ), 3.35-3.38 (m, 2H, -CH 2 S-), 3.84 

10 (ddd, J 6.1 Hz, J 7.1 Hz, J 1 0.5 Hz, 1H, -OCHH'-), 3.92 (td, J AJ 1.1 Hz, J, 6.6 Hz, 1H, H- 
5), 4.10-4.21 (m, 3H, H-6, H-6', -OCHH», 4.52 (d, J u 8.0 Hz, 1 H, H-l), 5.01 (dd, 
1 0.3 Hz, J 3A 3.5 Hz, 1 H, H-3), 5.20 (dd, J X2 8.0 Hz, J 23 1 0.3 Hz, 1 H, H-2), 5.40 (dd, J }A 
3.5 Hz,y 45 1.1 Hz, 1H, H-4); ,3 C NMR (100 MHz, CDC1 3 ) 8 20.6, 20.7, 20.8 (CH 3 COO- 
x 4), 36.1, (-CH 2 S-), 50.6 (CH 3 S0 2 -), 61.2 (-OCH 2 -), 67.0, 68.3, 68.5, 70.8, 71.0 (C-2, C- 

15 3, C-4, C-5, C-6), 101.3 (C-l), 169.5, 170.0, 170.1, 170.4 (CHjCOO- x 4); HRMS m/z 
(FAB+): Found 487.0936 (M+H + ); C 17 H 27 0, 2 S 2 requires 487.0944. 

Preparation of2-($-D-Galactopyranosyl)ethyl methanethiosulfonate (le). 

A solution of NaOMe (0.104 M, 0.8 mL) was added to a solution of 4j 

20 (See Figure 7) (778 mg, 1 .7 1 mmol) in MeOH ( 1 0 mL) under N 2 . After 4 hours, the 
reaction solution was passed through a Dowex 50W(H + ) plug (3 x 1 cm, eluant MeOH) 
and the solvent removed to give 2-bromoethyl p-D-galactopyranoside (4e) (See Figure 7) 
(450 mg, 92%) (The synthesis of unstable 4e has been described previously. Dahmen et 
al., "2-Bromoethyl Glycosides .4.2-Bromoethyl Glycosides in Glycoside Synthesis - 

25 Preparation of Glycoproteins Containing Alpha-L-Fuc-( 1 ->2)-D-Gal and Beta-D-Gal-( 1 - 
>4)-n-filcnac." Carbohvdr. Res. . 125:237-245 (1984), which is hereby incorporated by 
reference) as a white solid which was used directly in the next step. NaSS0 2 CH 3 (180 
mg, 1 .34 mmol) was added to a solution 4e (See Figure 7) (290 mg, 1 .01 mmol) in DMF 
( 1 2 mL) under N 2 and warmed to 50°C. After 1 5 hours, the solution was cooled and the 

30 solvent removed. The residue was purified by flash chromatography (MeOH : EtOAc, 
1 :9) to give le (See Figure 7) (229 mg, 71%) as a white foam; [oc] 27 D = + 2.9 (c 0.58, 
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H 2 0); IR (film) 3358 cm"' (br, O-H), 1306, 1 120 cm"' (S-S0 2 ); 'H NMR (500 MHz, D 2 0, 
COSY) 8 3.29-3.33 (m, 2H, -CH 2 S-), 3.30 (dd, J, , 7.7 Hz, J 2J 10.0 Hz, 1H, H-2), 3.35 (s, 
3H, CH,S0 2 -), 3.43 (dd, J 2 3 1 0.0 Hz, J 3: , 3.6 Hz, 1 H, H-3), 3.48 (ddd, J<. 5 0.9 Hz, J iA 4.3 
Hz, J,6 7.9 Hz, 1 H, H-5), 3.52 (dd, J 56 4.3 Hz, J 6 _ 6 - 1 1 .7 Hz, 1 H, H-6), 3.57 (dd, J 56 . 7.9 
Hz, J 66 - 1 1 .7 Hz, 1H, H-6'), 3.70 (dd, J 3 . 4 3.6 Hz, J 4 . 5 0.9 Hz, 1 H, H-4), 3.80 (dt, J d 1 1 .2 
Hz, J, 6.1 Hz, 1H,-OCHH'-),4.01 (dt, J d 1 1.4 Hz, J t 5.8 Hz, 1H, -OCHH>), 4.24 (d, J,. 2 
7.7 Hz, 1H, H-l); l3 C NMR (100 MHz, D 2 0) 6 36.7 (-CH 2 S-), 50.8 (CH 3 S0 2 -), 61 .9 (- 
OCH,-), 69.2, 69.6, 71.7, 73.7, 76.2 (C-2, C-3, C-4, C-5, C-6), 104.0 (C-l); HRMS m/z 
(FAB+): Found 319.0523 (M+H*); C 9 H l9 0 8 S 2 requires 319.0521. 

Preparation of 2-(2J,6-Tri-0-acetyl-4-0-(2JJ,6-tetra-O^ 
$-D-glucopyranosyl)ethyl methanethiosulfonate (lk). 

BF 3 .Et 2 0 (4.0 mL, 31 .5 mmol) was added dropwise to a solution of 

1 5 1 ,2,3,6-tetra-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-|3-D-galactopyranosyl)-P-D- 

glucopyranoside (5f) (See Figure 7) (5 g, 7.4 mmol) and Br(CH 2 ) 2 OH (0.65 mL, 9.2 
mmol) in CH 2 C1 2 (15 mL) at 0°C under N 2 . After 1 hour, the solution was warmed to 
room temperature. After 20 hours, the reaction solution was added to ice water (15 mL) 
and extracted with CH 2 C1 2 (20 mL x 2). These extracts were combined, washed with 

20 water (20 mL), sat. NaHC0 3 (aq., 20 mL), water (20 mL), dried (MgS0 4 ), filtered, and the 
solvent removed. The residue was purified by flash chromatography (EtOAc : hexane, 1 : 
1) to give 2-bromoethyl 2,3,6-tri-0-acetyl-4-0-(2,3,4,6-tetra-0-acetyl-|5-D- 
galactopyranosyl)-p-D-glucopyranoside (4k) (See Figure 7) (2.94 g, 53%) as a white 
foam; [a]" D = - 7.8 (c 1 .28, CHC1 3 ) [lit., (Dahmen et al., "2-Bromoethyl Glycosides - 

25 Synthesis and Characterization," Carbohvdr. Res. . 1 1 6:303-307 (1 983), which is hereby 
incorporated by reference) [a] 23 ,, = - 1 1 (c 1 .3, CHC1 3 )]; 'H NMR (500 MHz, CDC1 3 , 
COSY) 6 1.94, 2.02, 2.02 (s x3,3Hx 3, Ac x 3), 2.04 (s, 6H, Ac'x 2), 2.10, 2.13 (s x 2, 
3H x 2, Ac x 2), 3.38-3.46 (m, 2H, -CH,Br), 3.59 (ddd, J 5 .. 6 ~. 2.2 Hz, J 4.9 Hz, J 9.9 Hz, 
1H, H-5'), 3.75-3.80 (m, 2H, H-4', -OCHH'-), 3.85 (td, J 0 1.1 Hz, J, 6.9 Hz, 1H, H-5), 

30 4.03-4.12 (m, 4H, H-6, H-6', H-6", -OCHH», 4.45 (d, J u 7.8 Hz, 1H, H-l), 4.48 (dd, 

J 5 , 6 ... 2.2 Hz, J 6 ., 6 - 12.1 Hz, 1H, H-6'"), 4.50 (d, J vx 7.9 Hz, 1H, H-l '), 4.89 (dd, J rx 7.9 
Hz, J YS 9.6 Hz, 1H, H-2'), 4.92 (dd, J 2 . 3 10.5 Hz, J yA 3.4 Hz, 1H, H-3), 5.08 (dd, J lJ2 7.8 
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Hz, J 23 10.5 Hz, 1H, H-2), 5.18 (t, ./9.6Hz, 1H, H-3'), 5.32 (dd,J 3 . 4 3.4 Hz, J 4J 1.1 Hz, 
1H, H-4). NaSS0 2 CH 3 (87 mg, 0.65 mmol) was added to a solution of 4k (See Figure 7) 
(357 mg, 0.48 mmol) in DMF (6 mL) under N 2 and wanned to 50 °C. After 22 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 

5 chromatography (EtOAc : hexane, 1 1 :9) to give lk (See Figure 7) (327 mg, 88%) as a 
white foam; [a] 27 D = - 3.7 (c 1.0, CHC1 3 ); IR (KBr) 1751 cm" 1 (C=0), 1323, 1 134 cnV 1 (S- 
SO,); 'H NMR (500 MHz, CDC1 3 , COSY) 8 1 .94, 2.02, 2.02, 2.04, 2.04, 2.1 1, 2.13 (s x 7, 
3H x 7, Ac x 7), 3.29-3.40 (m, 2H, -CH 2 S-), 3.32 (s, 3H, CH 3 SO,-), 3.59 (ddd, J rs 9.9 
Hz, J 5 .. 6 ~ 4.9 Hz, J 5 , 6 - 2.2 Hz, 1H, H-5'), 3.77 (t, J9.5 Hz, 1H, H-4'), 3.79-3.86 (m, 2H, 

10 H-5, -OCHH'-), 4.03-4.13 (m, 4H, H-6, H-6', H-6", -OCHH1-), 4.46 (d, J„ 7.8 Hz, 1H, 
H-l ), 4.50 (d, J vx 8.0 Hz, 1H, H-l '), 4.52 (dd, J y ^ 2.2 Hz, «W- 1 1 .9 Hz, 1H, H-6'"), 
4.87 (dd, J vr 8.0 Hz, J r y 9.6 Hz, 1H, H-2'), 4.93 (dd,J 2 . 3 10.5 Hz, J 3A 3.5 Hz, 1H, H-3), 
5.08 (dd, J l2 7.8 Hz, J 2 . 3 10.5 Hz, 1H, H-2), 5.17 (t, J 9.3Hz, 1H, H-3'), 5.32 (dd, J 3A 3.5 
Hz, y 4 . 5 1.0 Hz, 1H, H-4); 13 C NMR (125 MHz, CDC1 3 , DEPT) 6 20.5, 20.7, 20.8, 20.9 (q 

15 x 4, CH 3 COO- x 7), 36.0 (t, -CH 2 S-), 50.6 (q, CH 3 S0 2 -), 60.7, 61 .6, 68.5 (t x 3, -OCH,-, 
C-6, C-6' ), 66.5, 69.0, 70.6, 70.9, 71 .3, 72.6, 72.8, 76.0 (d x 8, C-2, C-3, C-4, C-5, C-2', 
C-3',C-4',C-5'), 100.7, 101.1 (d x 2, C-l, C-l '), 169.1, 169.7, 169.7, 170.1, 170.2, 
170.3, 1 70.4 (s x 7, CH^OO- x 7); HRMS m/z (FAB+): Found 775.1793 (M+H + ); 
C 29 H 43 O 20 S 2 requires 775.1789. 

20 

Preparation of 2-(4-0-$-D-Galactopyranosyl-$-D-glucopyranosyl)ethyl 
methanethiosulfonate (1J). 

A solution of NaOMe (0.1 M, 0.6 mL) was added to a solution of 4k (See 
25 Figure 7) (877 mg, 1 .1 8 mmol) in MeOH (6 mL) under N 2 . After 3 hours, the reaction 
solution was passed through a Dowex 50W(H + ) plug (4 x 1 cm, eluant MeOH) and the 
solvent removed to give 2-bromoethyl 4-0-P-D-galactopyranosyl-P-D-glucopyranoside 
(41) (See Figure 7) (476 mg, 90%) as a white foam which was used directly in the next 
step. NaSS0 2 CH 3 (185 mg, 1.38 mmol) was added to a solution of 4f (See Figure 7) (476 
30 mg, 1 .06 mmol) in DMF (24 mL) under N 2 and warmed to 50°C. After 21 hours, the 
solution was cooled and the solvent removed. The residue was purified by flash 
chromatography (CHC1 3 : MeOH : AcOH : H 2 0, 60:30:3:5) to give If (See Figure 7) (346 
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mg, 68%) as a hygroscopic foam; [a] 28 D = + 1.5 (c 1.66, H,0); IR (KBr) 3416 cm-' (br, 
OH), 1311, 1131 cm"' (S-S0 2 ); 'HNMR (400 MHz, D 2 0, COSY) 8 3.10-3.13 (m, 1H, H- 
2), 3.30 (t, J 6.0 Hz, 2H -CH 2 S-), 3.31 (dd, J r2 - 7.8 Hz, J ry 10.3 Hz, 1H, H-2'), 3.34 (s, 
3H, CH,SO,-), 3.38-3.54 (m, 5H), 3.44 (dd, J ry 10.3 Hz, J yA 3.3 Hz, 1H, H-3'), 3.57 

5 (dd, J 8.4 Hz, J 1 1 .4 Hz, 1 H), 3.59 (dd, J 4.9 Hz, J 7.3 Hz, 1 H), 3.70 (br d, J yx 3.3 Hz, 
1H, H-4'), 3.77 (dd, J 1.0 Hz, J 1 1.5 Hz, 1H), 3.79-3.83 (m, 1H, -OCHH'-), 3.97-4.02 (m, 
1H, -OCHH1-), 4.22 (d, J r:r 7.8 Hz, 1H, H-l»), 4.33 (d, J l2 7.8 Hz, 1H, H-l); ,3 C NMR 
(125 MHz, D 2 0) 6 36.7 (-CH 2 S-), 50.8 (CH 3 SO,-), 61.0, 62.1, 69.4, 69.6, 71.9, 73.5, 73.7, 
75.3, 75.9, 76.4, 79.3 (-OCH,-, C-2, C-3, C-4, C-5, C-6, C-2', C-3', C-4', C-5', C-6'), 

10 103.3, 103.9 (C-l, C-l'); HRMS m/z (FAB+): Found 503.0886 (M+Na*); C,$H M 0,.,S 2 Na 
requires 503.0869. 

Example 2 - General Procedure for Modification of Subtilisin Bacillus lentus 
("SBL") Mutants Stored as Flash-Frozen Solutions 

15 

A 1.25 mL frozen aliquot of the mutant enzyme (SBL-N62C, -L217C, or 
-S166C) containing approximately 25 mg of enzyme was thawed and added to 1.25 mL of 
Modifying Buffer (see below) in a polypropylene test-tube. To this solution was added 
100 uL of a 0.2 M glyco-MTS reagent solution (la,g-k in MeCN, lb-fin water (See 

20 Figure 8)). The mixture was sealed, vortexed, and placed on an end-over-end rotator at 
room temperature. When the modification was complete (determined by a specific 
activity assay, using succinyl-AlaAlaProPhe-p-nitroanilide [e 410 = 8800 M" 1 cm"'] 
(Bonneau et al., "Alteration of the Specificity of Subtilisin BPN' by Site-Directed 
Mutagenesis in its SI and SV Binding-Sites," J- Am. Chem. Soc. 1 19:1026-1030 (1991), 

25 which is hereby incorporated by reference) as substrate in 0.1 M Tris-HCl buffer 

containing 0.005% Tween 80, 1% DMSO, pH 8.6 showing constant activity and titration 
with Ellman's reagent (£ 4I2 = 13600 M"' cm*') (Ellman et al., Biochem. Pharmacol., 7:88- 
95 (1961), which is hereby incorporated by reference) showing no free thiol present in 
solution), a further 50 uL of the modifying reagent solution was added and the mixture 

30 placed back on the end-over-end rotator for a further 1 0 minutes. The reaction was 

poured onto a pre-packed, pre-equilibrated G-25 Sephadex® PD10 column and eluted with 
3.5 mL Quench Buffer (see below). The eluant was dialysed at 4°C against 10 mM MES, 



WO 00/01712 



PCTAJS99/15138 



-34- 

1 mM CaCl, pH 5.8 (2 x 1L, 2 x 45 minutes). The resulting dialysate was flash frozen in 
liquid nitrogen and stored at -1 8°C. 



Modifying Buffer: 



pH9.5 
pH7.5 
pH 6.5 
P H5.5 



140 mM CHES, 2 mM CaCl, 
140 mM HEPES, 2 mM CaCl, 
1 40 mM MES, 2 mM CaCl, 
140 mM MES, 2 mM CaCl 2 



Quench Buffer: Reactions at pH 7.5 - 9.5: 5 mM MES 1 mM CaCl, pH 6.5 
10 Reactions at pH 5.5: 5 mM MES 1 mM CaCl, pH 5.5 

The free thiol content of all chemically modified mutant enzymes ("CMMs"), was 
determined spectrophotometrically by titration with Ellman's reagent (Ellman et al., 
Biochem. Pharmacol. . 7:88-95 (1961), which is hereby incorporated by reference) in 

15 phosphate buffer 0.25 M, pH 8.0. In all cases, no free thiol was detected. Modified 

enzymes were analyzed by nondenaturing gradient (8-25%) gels at pH 4.2, run towards 
the cathode, on the Pharmacia Phast-system and appeared as a single band. Prior to ES- 
MS analysis, CMMs were purified by FPLC (BioRad, Biologic System, Hercules, CA) on 
a Source 1 5 RPC matrix (17-0727-20 from Pharmacia, Bridgewater, NJ) with 5% 

20 acetonitrile, 0.0 1 % TFA as the running buffer and eluted with 80% acetonitrile, 0.01 % 
TFA in a one step gradient. MS m/z (ES-MS): N62C-S-a (See Figure 8) calculated 
27049, found 2705 1 ; N62C-S-b (See Figure 8) calculated 26925, found 26928; N62C-S-C 
(See Figure 8) calculated 26925, found 26928; N62C-S-d (See Figure 8) calculated 
26925, found 26925; N62C-S-e (See Figure 8) calculated 26925, found 26925; N62C-S-f 

25 (See Figure 8) calculated 27087, found 27087; N62C-S-g (See Figure 8) calculated 

27093, found 27096; N62C-S-Et-(J-Glc(Ac), calculated 27009, found 27015; N62C-S-Et- 
P-Glc(Ac) 3 calculated 27051, found 27053; N62C-S-i (See Figure 8) calculated 27093, 
found 27098; N62C-S-Et-(J-Gal(Ac) 3 calculated 27051, found 27051; N62C-S-k (See 
Figure 8) calculated 27381, found 27386; L217C-S-fJ-Glc calculated 26882, found 26879; 

30 L217C-S-p-Glc(Ac), calculated 26966, found 26962; L217C-S-fJ-Glc(Ac) 3 calculated 
27008, found 27006; L217C-S-b (See Figure 8) calculated 26926, found 26928; L217C- 
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S-c (See Figure 8) calculated 26926, found 26925; L217C-S-d (See Figure 8) calculated 
26926, found 26925; L217C-S-e (See Figure 8) calculated 26926, found 26928; L217C- 
S-f (See Figure 8) calculated 27088, found 27087; L217C-S-Et-a-Glc(Ac) 2 calculated 
27010, found 27012; L217C-S-Et-p-Glc(Ac) 5 calculated 27052, found 27056; L217C-S- 

5 Et-a-Man(Ac)., calculated 27052, found 27056; L2 1 7C-S-Et-p-Gal(Ac) 3 calculated 

27052, found 27053; L217C-S-Et-Lac(Ac) 6 calculated 27340, found 27342; S166C-S-a 
(See Figure 8) calculated 27076, found 27080; S 1 66C-S-b (See Figure 8) calculated 
26952, found 26955; S166C-S-C (See Figure 8) calculated 26952, found 26950; S166C-S- 
d (See Figure 8) calculated 26952, found 26952; S166C-S-e (See Figure 8) calculated 

10 26952, found 26952; S166C-S-f (See Figure 8) calculated 271 14, found 27112; S166C-S- 
Et-oc-Glc(Ac) 3 calculated 27078, found 27078; S 1 66C-S-Et-f>Glc(Ac) 2 calculated 27036, 
found 27040; S166C-S-Et-p-Glc(Ac) 3 (major) with S166C-S-H (See Figure 8) (minor) 
and S166C-S-Et-(J-Glc(Ac) 2 (minor) calculated 27078 (major), 27120 (minor), 27036 
(minor), found 27081 (major), 27121 (minor), 27036 (minor); S166C-S-Et-a-Man(Ac) 3 

15 calculated 27078, found 27085; S166C-S-Et-P-Gal(Ac) 3 calculated 27078, found 27079; 
S 1 66C-S-Et-Lac(Ac) 5 calculated 27324, found 2733 1 . 

Example 3 - General procedure for modification of SBL mutants stored as 
lyopbilized powders 

20 

This procedure was only used with S156C, which is stored as a lyophilized 
powder to prevent dimerization. Into a polypropylene test tube was weighed about 25 - 
30 mg of lyophilized S156C. This was dissolved in the following modifying buffers (2.5 
mL): 

25 pH 9.5: 70 mM CHES, 2 mM CaCl, 

pH 7.5: 70 mM HEPES, 2 mM CaCU 
pH 6.5: 70 mM MES, 2 mM CaCl 2 
pH 5.5: 70 mM MES, 2 mM CaCU 

30 Glyco-MTS reagent was added and the reaction then proceeded as for the other mutants, 
using the appropriate quench buffer. MS m/z (ES-MS): S156C-S-a (See Figure 8) 
calculated 27076, found 27079; S156C-S-b (See Figure 8) calculated 26952, found 
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26955; S156C-S-C (See Figure 8) calculated 26952, found 26952; S156C-S-d (See Figure 
8) calculated 26952, found 26952; S156C-S-e (See Figure 8) calculated 26952, found 
26952; S 1 56C-S-f (See Figure 8) calculated 271 14, found 271 15; S156C-S-g (See Figure 
8) calculated 27120, found 27123; S156C-S-h (See Figure 8) calculated 27120, found 
5 27122; S156C-S-i (See Figure 8) calculated 27120, found 27123; S156C-S-j (See Figure 
8) calculated 27120, found 27120; S156C-S-U (See Figure 8) calculated 27408, found 
27411. 

Example 4 - Contents of Acetylated Glyco-CMM Libraries 



cysteine mutants with la, g-k (See Figure 8) at various pH levels were determined and are 
set forth in Tables 1 and 2, below. 

15 Table 1 . Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la at various pH a 



10 



The levels of acetylation of glyco-CMMs after modification of SBL 




N62C 
S156C 



4 



4 



4 



4 



S166C 4 - 4 

L217C 0 2 b 3 b 



* Isolated as single species unless indicated. b Single product mass by ES-MS. 



WO 00/01712 



PCT/US99/15138 



-37- 

Table 2. Levels of Acetylation of Glyco-CMMs after Modification of SBL Cysteine 
Mutants with la,g-k at pH 5.5 a 



\Reagent 
Enzyme\ 


la 


lg 


lh 


li 


lj 


lk 


N62C 


4 


4 


3 b 


4 


3 b 


7 


S156C 


4 


4 


4 


4 


4 


7 


S166C 


4 


-» 


3 c ,4 d ,2 d 


3 b 


3 b 


5 b 


L217C 


3 b 


2 b 


3 b 




3 b 


6 b 



a Isolated as single species unless indicated. 
b Single product mass by ES-MS. 



5 c Major component. 
d Minor component. 

Example 5 - Incubation of L217C-S-p-Glc(Ac), at pH 9.5 

10 The general procedure for modification of SBL mutants stored as flash- 

frozen solutions was used to incubate 1.26 mg of L217C-S-P-Glc(Ac) 3 as a 0.5 mL 
aliquot in the absence of MTS reagent for 2 hours to give L217C-S-P-Glc as the sole 
product. MS m/z (ES-MS): L217C-S-pM31c calculated 26882, found 26885. 

1 5 Example 6 - Active Site Titrations 

The active enzyme concentration was determined as previously described 
(Hsia et at, "Active-Site Titration of Serine Proteases Using a Fluoride-Ion Selective 
Electrode and Sulfonyl Fluoride Tnhihitors " Anal. Biochem. . 242:221-227 (1996), which 
20 is hereby incorporated by reference) by monitoring fluoride release upon enzyme reaction 
with a-toluenesulfonyl fluoride (PMSF) as measured by a fluoride ion sensitive electrode 
(Orion Research 96-09). The active enzyme concentration determined in this way was 
used to calculate k cat values for each CMM. 
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Example 7 - Kinetic Measurements 

Michaelis-Menten constants were measured at 25(± 0.2)° C by curve 
fitting (GraFit" 3.03. Erithacus Software Ltd.. Staines. Middlesex. UK) of the initial rate 
5 data determined at nine concentrations (0.125 mM-3.0 mM) of succinyl-AAPF-/?NA 
substrate in 0.1 M Tris-HCl buffer containing 0.005% Tween 80, 1% dimethyisufoxide 
("DMSO"). pH 8.6 (£,„, = 8800 M' 1 cm"') (Bonneau et al.. J. Am. Chem. Soc. 1 19:1026- 
1030 (1991). which is hereby incorporated by reference). 

10 Example 8 - Controlled Site Selective Glycosylation of Proteins by a Combined Site- 
Directed Mutagenesis and Chemical Modification Approach 

Four SBL sites at different locations and of different characteristics were 
selected for mutation to cysteine in order to provide a broad test of the glycosylation 
1 5 methodology. SI 56 of the S,-pocket (Nomenclature of Schechter; Berger, Biochem. 

Bio phvs. Res. Commun. . 27:157-162 (1967). which is hereby incorporated by reference) 
is a surface-exposed residue that permits the introduction of externally-disposed glycans 
mirroring those found naturally in glycoproteins (Molecular Glvcobiology. Fukuda et al., 
Eds., Oxford University, Oxford (1994), which is hereby incorporated by reference). In 
20 contrast, N62 in the S 2 pocket, SI 66 in the S, pocket, and L21 7 in the S,' pocket have side 
chains which are internally oriented and test the applicability of the method for 
introducing sugars at hindered locations. Broad applicability with respect to the sugar 
moiety was evaluated by using the representative series of protected and deprotected, 
mono- and disaccharide methanethiosulfonates ("MTS") la-k (see Figure 8). These were 
25 prepared from their parent carbohydrates in good to excellent yields (Figures 6 (Reagents 
and Conditions: (i) Ac 2 0, py then HBr, AcOH; (ii) NaSS0 2 CH 3 , EtOH, 90° C; 
(iii) Br(CH 2 ) 2 OH, BF 3 .Et 2 0 then Ac 2 0, py; (iv) NaSS0 2 CH„ DMF. 50° C; (v) NaOMe. 
MeOH; (vi) Ac 2 O s py then Br(CH 2 ) 2 OH, BF v Et 2 0. DCM) and 7 (Reagents and 
Conditions: (i) Ac 2 0, py. 92% for 5d, 99% for 5e; Ac 2 0. NaOAc, 82% for 5f; 
30 (ii) Br(CH,) 2 OH, BF,.Et 2 0, DCM. 70% for 4i, 67% for 4j, 53% for 4k; (iii) NaOMe. 

MeOH, 96% for 4d. 92% for 4e. 90% for 4f; (iv) NaSS0 2 CH v DMF, 50° C. 80% for Id. 
71% for le. 68% for If, 88% for li. 83% for lj. 88% for lk)). Two types of 
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15 



glycosylating reagents, the anomeric methanethiosulfonate la and the ethyl -tethered 
methanethiosulfonates lb,c,g,h, were prepared from D-glucose (2a, Figure 6). The 
preparation of these reagents in fully protected la,g,h and deprotected lb,c forms allowed 
the effects of increased steric bulk and hydrophobicity to be assessed. Untethered MTS 
5 reagent 1 a was readily prepared from acetobromoglucose (3) using NaSS0 2 CH, as shown 
in Figure 6 (Prepared from D-glucose according to Scheurer et al., .1 Am. Chem. Soc , 
76:3224 (1954), which is hereby incorporated by reference). For the preparation of lb,g, 
an a-linked ethyl tether was introduced using Fischer glycosidation of D-glucose (2) with 
2-bromoethanol. Treatment of the tetraacetylbromide 4g with NaSSO,CH 3 allowed the 
1 o preparation of the peracetylated a-gluco-MTS lg in an excellent 90% yield. Zemplen 
deacylation (Zemplen et al., Dtsch. Chem. Ges.. 56:1705-1710 (1923), which is 
hereby incorporated by reference) of bromide 4g and subsequent displacement of bromide 
by methanethiosulfonate ion proceeded smoothly to yield the fully deprotected a-gluco- 
MTS lb in 69% yield. The (J-D-gluco-MTS reagents lc and lh, which are epimeric at C- 
1 relative to lb and lg, respectively, were prepared from the corresponding peracetylated 
0-bromide 4h. The preparation of 4h took advantage of well-defined methodology 
utilizing Lewis acid catalyzed displacement of anomeric acetates by alcohols (Dahmen et 
al., "2-Bromoethyl Glycosides - Synthesis and Characterization," Carbohydr. Res., 
1 1 6:303-307 (1983), which is hereby incorporated by reference). The protected bromide 
20 4h was elaborated to the corresponding peracetylated (lh) and deprotected (lc) (5-gluco- 
MTS reagents in an essentially identical manner to that used for the epimeric a-gluco- 
MTS reagents. Thus, using NaSS0 2 CH 3 , 4h gave lh in 78% yield and, following 
deprotection, 4c (Helferich et al., lust T ieh. Ann. Chem.. 541 : 1-1 6 (1939), which is 
hereby incorporated by reference) afforded lc in 68% yield from 4h. Parallel routes 
25 allowed similarly efficient access to the a-D-manno-MTS reagents 1 d and 1 i, which are 
epimeric at C-2 relative to lb and lg, respectively, and the p-D-galacto-MTS reagents le 
and lj, epimeric at C-4 relative to lc and lb, respectively (Figure 7). The ready 
adaptability of this method to oligosaccharides was illustrated by the preparation of the 
peracetylated (lk) and fully deprotected (If) disaccharide lacto-MTS reagents, in good 
30 overall yields from lactose (2f) of 38% and 27% respectively without cleavage of the 
interresidue bond. 
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F.\am ple 9 - Site Specific Glycosylation. 

The glyco-MTS reagents la-k (See Figure 8) were reacted with the chosen 
cysteine mutants SBL-N62C, -S156C, -S166C and -L217C in aqueous buffer under 
conditions described previously (Stabile et ah, Rioorg. Med. Chem. Lett., 6:2501-2512 
(1996); Berglund et al., J. Am. Chem. Soc . 1 19:5265-5266 (1997); DeSantis et a!., 
Biochem. , 37:5968-5973 (1998), which are hereby incorporated by reference). These 
reactions were rapid and quantitative, as judged by monitoring of changes in specific 
activity and by titration of free thiols with Ellman's reagent (Ellman et al., Biochem. 
Pharmacol. , 7:88-95 (1961), which is hereby incorporated by reference). The 
glycosylated chemically modified mutants (CMMs) were purified by size-exclusion 
chromatography and dialysis, and their structures were confirmed by rigorous ES-MS 
analyses (± 7 Da) as shown in Table 3 below: 
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Table 3. 

Properties and Kinetic Parameters 3 of Modified Enzymes 



Entry 



Reactant Pocket 
Enzyme 



MTS 
Reagent 



Reacn. pH 



Produces) 



Km ImM) 



k<JK» 
<s'mM') 



I SBL-WT 



153 ±4 



0.73 ±0.05 



2 N62C S 2 

3 la 

4 lb 

5 1c 

6 id 

7 U 

8 If 

9 lg 

10 in 

11 in 

12 H 

13 U 

14 Ik 



— I " I I74±9 1 .90 ±0.20 

9.5 N62C-SV 67.9 ± 3.5 0.52 ± 0.07 

6 ^ N62C-S-D* 135.3 ±3.5 0.94 ± 0.05 

6*5 N62C-S-c b 132.7 ±4.0 1.25 ±0.08 

6 S N62C-S-d b 132.9±3.1 1.04 ±0.05 

6 S N62C-SV 119.3 ± 3.6 0.99 ± 0.07 

6 S N62C-S-f 129.8 ±2.4 1.04 ±0.04 

5^5 N62C-S-g b 120.0 ±2.7 0.52 ± 0.03 

6 5 N62C-S-Et-0-Glc<Ac), c 87.7 ±4.2 1.63±0.15 

5.5 N62C-S-Et-0-GMAc).< 100.3 ±3.5 1.86 ±0.12 

5.5 N62C-S-i b 123.0 ± 1 .6 1 .05 ± 0.03 

S S N62C-S-Et-0-Gal(Ac)-. c 103.4 ±4.3 2.36 ±0.17 

5.5 N62C-S-k b 64.9 ± 1 .5 0.88 ± 0.05 



209 ± 15 
92+11 
1 30.6 ± 18.8 
143.9 ±8.5 
106.2 ±7.5 
127.8 ±6.8 
120.5 ±9.3 
124.8 ±5.3 
230.8 ± 14.3 
53.8 ±5.8 
53 9 ± 4.0 
117.1 ±3.7 
43.8 ± 3.6 
73.8 ±4.5 



15 L2I7C S/ 

16 la 9.5 

17 U 7.5 

18 la 5.5 

19 lb 6.5 

20 1« 6.5 

21 Id 6.5 

22 le 6.5 

23 If 6.5 

24 lg 5.5 

25 lb 5.5 

26 l» 5.5 

27 U 5.5 

28 Ik 5.5 



41 SI66C S, 

42 

43 

44 

45 

46 

47 

48 

49 

50 



51 

52 
53 



~ ' 41 ± 1 0.80 ± 0.04 " 51+3 

L2 1 7C-S«P-G»c b 27.7 ± 0.4 0.79 ± 0.03 35.1 ± 1.4 

L2 1 7C-S-P-Glc( Ac), c 44.9 ± 2.0 0.44 ± 0.06 1 02.0 ± 1 4.6 

L2 1 7C-S-P-Glc< Ac)> 36.3 ± 0.8 0.36 ± 0.03 1 00.S ±8.7 

L2l7C-S-b b 57.8 ±0.6 0.67 ± 0.02 86.3 ± 2.7 

L2I7C-S-C* 50.6 ±0.9 0.67 ± 0.03 75.5 ± 3.6 

L2l7C-S-d b 62.0 ± 1.3 0.55 ± 0.03 1 12.7 ± 6.6 

L2l7C-S-e b 46.2 ±0.8 0.63 ± 0.08 73.3 ±3.7 

L2 1 7C-S-f> 30.4 ± 0.6 0.46 ± 0.03 66. 1 ± 4.5 

L2l7C-S-Et-a-Glc(Ac), e 72.7±3.I 0.73 ± 0.08 99.6±11.8 

L2 1 7C-S-Et-MHc< Ac), e 29.4 ± 0.8 0.93 ± 0.06 3 1.6 ± 2.2 

L2 1 7C-S-Et-a-Man( Ac); e 97.8 ± 2.4 0.59 ± 0.04 1 65.8 ± 1 2.0 

L2 1 7C-S-Et-p-GaK Ac) ; c 39.2 ±0.8 1 . 1 7 ± 0.05 33.5 ± 1 .6 

L2 1 7C-S-Et-Lac< Ac) 6 < 27.1 ± 0.6 0.69 ± 0.04 39.3 ±2.4 



29 S156C S, 

30 la 9.5 

31 lb 6.5 

32 1c 6.5 

33 Id 6.5 

34 le 6.5 

35 If 6.5 

36 lg 5.5 

37 lb 5.5 

38 li 5.5 

39 lj 5.5 

40 Ik 5.5 



S156C-S-a b 
Sl56C-S-b b 
S156C-S-c b 
S156C-S-d b 
SI56C-S-e b 
SISoC-S-f 
S156C-S-g b 
S156C-S-b b 
SlSoC-S-r* 
S156C-S-j b 
S156C-S-k b 



125±4 

54.8 ± 13 
77.0 ± 1.2 
76.6 ± 1.7 
88.6 ± 2.8 

78.9 ± 1.9 
63.6 ± 1.4 
43.6 ±0.8 
64.0 ± 1.3 
60.3 ± 0.9 
5I.9±0.6 
53.6 ±0.8 



0.85 ± 0.06 
0.70 ± 0.04 
0.84 ± 0.03 
0.73 ± 0.04 
0.79±0.06 
0.89 ±0.04 
0.89 ± 0.05 
0.78 ±0.04 
0.72 ±0.04 
0." I ±0.03 
0.6 1 ±0.02 
0.79±0.03 



la 9.5 S166C-S-a b 

lb 6.5 S166C-S-b b 

lc 6.5 S166C-S-** 

Id 6.5 S166C-S-d b 

le 6.5 SI66C-S-« b 

If 6.5 SI66C-S-f > 

]g 5.5 S166C«S«Et-a-Glc<Ac), e 

1 h 6.5 SI 66C-S-Et-P-Glc(Ac) 2 c 

S166C-S-Et-p-Glc<Ac) 3 d 

lb 5.5 SI66C-S-Et-M5MAc) 2 e 

SI66C-S-h e 

li 5.5 S 1 66C-S-Et-a -Man< Ac)-/ 

lj 5.5 S166C-S-El-P-Gal(Ac) 3 d 

Ik 5.5 S 1 66C-S-Et-Lac(Ac), <1 



42+ I 

33.8 ± 13 
81.9± LI 
67.0 ± 22 
76.5 ± 2.0 
62.2± 1.4 
58.2 ± 1.2 
3I.0±0.8 

95.0 ±2. 1 

72.9 ± 1.7 

67.7 ± 1.9 

65.1 ±0.9 
67.4 ± 1.6 



0.50 ± 0.05 
0.66 ± 0.06 
1.14 ±0.03 
0.99 ± 0.07 
1.17 ±0.07 
1.08 ±0.05 
1.02 ±0.04 
0.77 ± 0.05 
0.87 ± 0.05 

0.65 ±0.04 

1.64 ±0.09 
0.80 ±0.03 

1.65 ±0.07 



147+11 

78.3 ± 4.8 
91.7 ±3.6 
104.9 ±6.2 
112.2 ±9.2 

89.7 ± 4.4 
7I.8±43 
55.9 ± 3.0 
88.9 ±5.2 
84.9 ±3.8 

85.1 ±3.0 

67.4 ± 2.8 
84 ±9 

51.2 ±5.0 
71.8±2.1 
67.6 ± 5.3 
65.4 ± 43 
57.6 ±3.0 
57.1±2.5 
403 ± 2.8 
109.2 ±6.7 

1 1 2.2 ± 7.4 

41.3 ±2.5 
8I3±3.3 

40.8 ± 2.0 



" Michael is-Mentcn constants were 
0.005% Tween 80, 1% DMSO. sue- 
b Single species. 

c Single product mass by ES-MS. 
d Major component. 
' Minor component. 



measured at 25 °C according to the initial 
AAPF-pNA as the substrate. 



rates method in 0.1 M Tris-HCl buffer at pH 8.6 ? 
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The CMMs each appeared as a single band on non-denaturing gradient PAGE, thereby 
establishing their high purities. The active enzyme concentration of the resulting CMM 
solutions was determined by active site titration with a-toluenesulfonyl fluoride (PMSF) 

5 using a fluoride ion-sensitive electrode (Hsia et al., "Active-Site Titration of Serine 
Proteases Using a Fluoride-Ion Selective Electrode and Sulfonyl Fluoride Inhibitors," 
Anal. Biochem. . 242:221-227 (1996), which is hereby incorporated by reference). In all 
cases, modification with the fully deprotected reagents lb-f (See Figure 8) led to site- 
specific glycosylations and the formation of single glycoforms. Furthermore, 

10 modification with the protected MTS reagents la, g-k (See Figure 8) gave products with 
controllable levels of acetylation. Through adjustment of pH and appropriate selection of 
the glycosylation site, differently acetylated glycoforms of SBL were prepared. This 
ability to modulate the level of acetylation through pH-control vastly expands the 
structural variety of glyco-CMMs that can be conveniently accessed and its scope was 

1 5 probed through the reaction of 1 a (See Figure 8) with SBL-N62C, -S 1 56C, -S 1 66C, - 
L217C (Figure 9). 

The extent of deacetylation during modification was highly site-dependent. 
Modification of L217C with reagent la (See Figure 8) at pH 9.5 was accompanied by 
complete in situ deacetylation, and the sole product was the fully deprotected 

20 glucosylated-SBL, L217C-S-P-Glc. In contrast, treatment of N62C, S156C, and S166C 
with la (See Figure 8) at pH 9.5 yielded only fully acetylated products, N62C-S-a (See 
Figure 8), SI 56C-S-a (See Figure 8), and S166C-S-a (See Figure 8), respectively. To 
examine the effects of pH upon deacetylation, the reaction of L217C with la (See Figure 
8) was chosen. At pH 7.5 and 5.5, the products retained two and three acetate groups, 

25 forming L21 7C-S-p-Glc(Ac) 2 and L217C-S-f}-Glc(Ac) 3 , respectively. In all cases, 
complete integrity of the site selectivity was retained. 

This valuable site-dependent deacetylation was attributed to a novel 
intramolecular SBL-catalyzed process. Although acetate esters are moderately 
chemically labile in aqueous solution at pH 9.5, they are not at either pH 7.5 or 5.5 

30 (Greene et al., Protective Groups in Organic Synthesis , 2nd ed. New York, Wiley ( 1 99 1 ), 
which is hereby incorporated by reference). The striking differences in behavior during 
modification between L217C and the three other mutants N62C, S156C, and S166C 
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under identical reaction conditions discounted the possibility of deacetylation prior to 
modification. In addition, it was noted that position 217 bears an internally-oriented side 
chain and that modification of surface exposed position 156 showed no sign of 
deacetylation. This observation discounted both the possibility of either in situ chemical 
5 deacetylation or intermolecular enzymatic deacetylation. Furthermore, this ability of SBL 
to intramolecularly deacetylate was confirmed by the reaction of L217C-S-P-Glc(Ac) 3 at 
pH 9.5. Incubation of L217C-S-P-Glc(Ac). under standard modification reaction 
conditions, but without reagent la (See Figure 8), gave L217C-S-P-Glc as the sole 
product (Figure 9). 

, 0 The enormous potential of this method was demonstrated by the 

preparation of a small library of differently acetylated glycosylated CMMs through the 
reaction of SBL-N62C, -S156C, -S166C, and -L217C with MTS reagents lg-k (See 
Figure 8). Using the pH-activity profiles of wild-type ("WT") and CMMs of SBL as a 
guide, pH 5.5 and 6.5 were chosen to minimize deacetylation. Typically, the specific 

15 activity of SBL and its CMMs drops sharply below pH 7.5 to levels that at pH 5.5 are 5- 
20% those at optimal pH (8.5-9.5) (Desantis et al., "Chemical Modifications at a Single 
Site Can Induce Significant Shifts in the pH Profiles of a Serine Protease," J. Am. Chem. 
Soc, 120:8582-8586 (1998), which is hereby incorporated by reference). As expected, 
this drop in hydrolytic activity was reflected in the products of these modifications with 

20 lg-k (See Figure 8), which in all cases retained two or more acetate groups. 

For example, at pH 5.5 reactions of SBL-L217C and -S166C created 
singly deacetylated CMMs, with the exception of pure dideacetylated glyco-CMMs 
L217C-S-Et-a-Glc(Ac) 2 , and Sl66C-S-Lac(Ac) 5 . The formation of the latter may reflect 
the presence of two primary acetates in disaccharidic MTS reagent lk (See Figure 8). 

25 Primary acetate groups are typically more labile than secondary acetate groups under 
conditions of intermolecular enzymatic deacetylation (Bashir et al., "Enzymatic 
Esterification and De-Esterification of Carbohydrates - Synthesis of a Naturally- 
Occurring Rhamnopyranoside of P-Hydroxybenzaldehyde and a Systematic Investigation 
of Lipase-Catalyzed Acylation of Selected Arylpyranosides," J Chem. Soc, Perkin 

30 Trans. I . 2203-2222 (1 995), which is hereby incorporated by reference). In contrast, 

reactions of SBL-S156C gave only the fully acetylated CMMs, S156C-S-g-k (See Figure 
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8). This uniform lack of deacetylation observed for surface exposed glycans at position 
156 was consistent with an intramolecular enzyme-catalyzed mechanism requiring 
internally-oriented acetate groups. Interestingly, the reactions of SBL-N62C were also 
determined by the anomeric configuration of lg-k (See Figure 8) with a-MTS reagents 

5 lg,i (See Figure 8) giving products that retained all acetate groups while P-MTS reagents 
lh,j ? k (See Figure 8) were monodeacetylated. The range of accessible acetylated glyco- 
CMMs was further extended through modification at pH 6.5. For example, at position 62, 
it allowed the introduction of diacetylated p-glucose, forming N62C-S-Et-p-Glc(Ac) 2 , in 
place of the triacetylated N62C-S-Et-p-Glc(Ac) 3 formed at pH 5.5. The range of 

1 0 acetylation at position 1 66 was similarly expanded through the formation S 1 66C-S-Et-p- 
Glc(Ac) 2 in place of S166C-S-Et-(J-Glc(Ac) 3 . 



Example 10 - Glycan Structure-Hydrolytic Activity Relationships 

1 5 The effects of glycosylation upon SBL were assessed by the determination 

of k cat and K M for the hydrolysis of succinyl-AAPF-/?-nitroanilide (Suc-AAPF-/?NA) at 
pH 8.6. The kinetic parameters of the 48 CMMs generated were compared with those of 
WT and unmodified mutants in Table 3. The excellently selective and controlled method 
shown in Figure 8 allowed the introduction of structurally related monosaccharides, D- 

20 glucose, D-galactose, and D-mannose, in addition to the more sterically bulky 

disaccharide lactose. From the resulting glycosylated CMMs, a detailed and precise set of 
structure-activity relationships was generated (Figures 10-12). 

At position 62, in the S 2 pocket, the 2.3-fold reduction in kJK M caused by 
mutation to cysteine was partially restored by glycosylation (Figure 10A). The 

25 introduction of ethyl-tethered a- or p-glucose, p-galactose or a-mannose to N62C 

increased kJK M and formed N62C-S-b-e (See Figure 8) with kJK^ 1 .5- to 2-fold lower 
than WT. Despite its steric bulk and high hydrophilicity, disaccharidic lacto-CMM 
N62C-S-f (See Figure 8) also showed higher activity than N62C with kJK M only 1 .7-fold 
lower than WT. 

30 The effects of the mutation of position 217 in the S,' pocket were 

intrinsically more dramatic as indicated by a value oikJK M for L21 7C that is 4-fold 
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lower than WT (Figure 10B). The introduction of deprotected unlethered glucose, 
forming L21 7C-S-p-Glc, lowered kJK h1 further to 6-fold lower than WT. In contrast, 
glycosylation of position 21 7 with ethyl tethered MTS reagents lb-f (See Figure 8) 
restored activity and kJK h p for L217C-S-b-f (See Figure 8) were similar to each other in 

5 the range 2.5- to 3. 1 -fold lower than WT. This striking difference between tethered 

L217C-S-b-f (See Figure 8) and untethered L217C-S-P-Glc illustrated that SBL tolerates 
the replacement of hydrophobic Leu with highly hydrophilic carbohydrate moieties when 
they are linked by a hydrophobic ethyl spacer group better than directly-linked Cys-S-f3- 
Glc. This may indicate that a structural requirement for efficient amidase activity is a 

10 closely-bound hydrophobic residue in the S,' subsite of SBL and contrasts sharply with 
the excellent enhancement of esterase activity caused by the same Cys-S-0-Glc 
substitution. 

Mutation of position 156 in the S, pocket to cysteine caused a 1.4-fold 
drop in k c JK M (Figure 10C). Subsequent introduction of deprotected S-Et-ct-Glc, side 

15 chain b (See Figure 8), resulted in a kJK M for S 1 56C-S-b (See Figure 8) that was 2.3- 
fold lower than WT. From S 1 56C-S-b to -f (See Figure 8), kJK^fi varied in an arced 
manner peaking at 1 .9-fold lower than WT for S156C-S-d (See Figure 8) and then 
decreasing monotonically to a kJK M for S156C-S-f (See Figure 8) that was 3-fold lower 
than WT. The similar K M values for these S 1 56 CMMs to those of SBL-WT were 

20 indicative of these modifications having little effect upon ground state binding and were 
consistent with the surface exposed orientation of the SI 56 side chain. 

At position 166, in the S, pocket, the 2.5-fold decrease in kJK M caused by 
mutation to cysteine was amplified by modification with lb (See Figure 8) and led to a 
kJK M value 3-fold lower than WT for S 1 66C-S-b (See Figures 8 and 1 0D). From 

25 S 1 66C-S-b to -f (See Figure 8), kJK M decreased monotonically to a kJK M for S 1 66C-S- 
f (See Figure 8), in which the S, binding site was occupied by the sterically bulky 
disaccharide lactose, that was 3.8-fold lower than WT. 
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Fxample 11 - Kinetic Effects of Glycosylate with Acetylated Carbohydrates. 

The enormous potential of the controlled site-selective glycosylation 
approach depicted in Figure 8 was illustrated by the great variety of changes in kJK M that 
were caused by the introduction of acetylated side chains a,g-k (See Figure 8) to SBL. 

5 These dramatic changes contrast with the slight variations found for deprotected side 
chains b-f (See Figure 8). For example, at position 62, an alternating decrease-increase 
pattern was observed (Figure 1 1 A). This resulted in a kJK M for tetraacetylated a-gluco- 
CMM N62C-S-g (See Figure 8) that was 1 .1 -fold higher than WT. Similar alternating 
patterns were also seen at positions 217 (Figure 1 IB) and 166 (Figure 1 ID). At position 

10 156, variations were slight, which was consistent with its surface exposed orientation 
(Figure 11 C). 

To examine the cause of these variations, the kJK h p of acetylated 
glycosylated CMMs were compared with those for deprotected glycosylated CMMs with 
the same glycan structure and stereochemistry (Figure 12). This separated the effects of 
1 5 acetylation from the effects of glycosylation and allowed the underlying effects of 
modification to be dissected. It was clear from Figure 12 that the anomeric 
stereochemistry of the acetylated glycans modulates kJK M . 

For example, at position 62 (Figure 12A) comparison of N62C-S-b,c (See 
Figure 8) withN62C-S-g (See Figure 8) and N62C-S-Et-[i-Glc(Ac) 2J showed that 
20 increasing the number of acetate groups from zero to four, from N62C-S-b (See Figure 8) 
to N62C-S-d (See Figure 8), increased kJK M 1 .6-fold for the a-gluco side-chain b (See 
Figure 8). In contrast, increasing the number from zero to two or three, from N62C-S-C to 
N62C-S-Et-p-Glc(Ac) 2or3 , was detrimental for the 0-gluco side-chain c (See Figure 8) and 
led to a 2-fold decrease. Similarly, N62C-S-Et-P-Gal(Ac) 3 displayed a distinctly lower 
25 kJK M than N62C-S-e (See Figure 8) that was 5-fold lower than WT. These changes in 
kJK At upon acetylation were manifested largely through increased or decreased ground 
state binding, of which the most striking example was a K M for N62C-S-Et-p-Gal(Ac) 3 
that was 2.4-fold higher than deprotected galacto-CMM N62C-S-e (See Figure 8). 

At position 217, control of the level of acetylation through pH, as shown in 
30 Figure 9, had allowed the introduction at position 217 of untethered p-D-glucose bearing 
zero, two, and three acetate groups. As Figure 12B illustrates, the addition of two or three 
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acetate groups restored kJK u from 6-fold lower than WT for L21 7C-S-P-Glc to 2-fold 
lower than WT for L217C-S-p-Glc(Ac) 2 or L217C-S-p-Glc(Ac) 3 . This showed that 
acetylation allowed fine-tuning of activity and paralleled increases in the esterase kJK k p 
of these CMMs. 

5 The same trend in kJK u was observed for the L2 1 7C ethyl tethered 

CMMs as at position 62: acetylation was beneficial to a-tethered but detrimental to p- 
tethered CMMs. For example, increasing the number of acetate groups in the a-linked 
glucose moiety from zero to two, i.e., from L217C-S-b (See Figure 8) to L217C-S-Et-a- 
Glc(Ac) 2 , increased kJK u to 2-fold lower than WT. In contrast, increasing the number of 

0 acetate groups in the epimeric p-linked moiety from zero to three, i.e., from L2 1 7C-S-C 
(See Figure 8) to L21 7C-S-Et-p-Glc(Ac) 3 , halved kJK M to 6-fold lower than WT. 
Similarly, the kJK M of a-linked L21 7C-S-Et-a-Man(Ac) 3 was 1 .5-fold higher than the 
corresponding deprotected L217C-S-d (See Figure 8), while p-linked L217C-S-Et-P- 
Gal(Ac) 3 was 2-fold lower than the corresponding deprotected L217C-S-e (See Figure 8). 

1 5 Consistent with its surface exposed orientation, the changes at position 1 56 

caused by acetylation were slight and no variation with anomeric stereochemistry was 
seen (Figure 12C). Increasing the number of acetate groups from zero to four, from 
S156C-S-b-e (See Figure 8) to S156C-S-g-j (See Figure 8), decreased kJK M by 1.05- to 
1.6-fold. The most sterically bulky lactose side chain (-k) (See Figure 8) gave rise to the 

20 lowest kJK u at position 156, 3.2-fold lower than WT, and indicated that even at the 
surface of SBL the introduction of sterically bulky groups still allowed tailoring of 

At position 1 66, the effects of increased acetylation were, as at positions 
62 and 217, modulated by glycan anomeric configuration (Figure 12D). However, the 

25 direction of these increases and decreases was reversed: acetylation was beneficial to 0- 
tethered but detrimental to a-tethered CMMs. For example, the a-tethered S166C-S-Et- 
a-Glc(Ac) 3 had a 1.8-fold lower kJK M value than the corresponding unprotected S166C- 
S-b (See Figure 8), while p-linked CMMs S166-S-Et-P-Glc(Ac) 2J had 1.6-fold higher 
kJK M values than the corresponding fully deprotected S166-S-C (See Figure 8). Again, 

30 these variations were largely manifested through changes in ground state binding. For 
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example, K ht increased 1.4-fold from S166C-S-Et-a-Man (-d) (See Figure 8) to S166C-S- 
Et-a-Man(Ac) 5 . 

It should be noted that the changes in activity of the lacto-CMMs upon 
acetylation fell largely outside of these trends and at all four positions acetylation of the 
5 bulky, disaccharidic side-chain f (See Figure 8) caused a general decrease in k c JK K p. For 
example, at position 62 heptaacetylation, from N62C-S-f (See Figure 8) to N62C-S-k 
(See Figure 8), resulted in a lowering of kJK M \o a value that is 3-fold lower than WT. 
Despite the greater steric bulk of side chain k (See Figure 8), this drop was a consequence 
of a lower A t „„ 2-fold lower than N62C-f (See Figure 8), rather than a higher K Kt . In fact, 
1 o the K AJ value of N62C-S-k (See Figure 8) was 1 .3-fold lower than N62C-S-f (See Figure 
8). Similarly, L217C-S-Et-Lac(Ac) 6 and S166C-S-Et-Lac(Ac) 5 had 1.7-fold and 1.4-fold 
lower k c JK s £ than the corresponding deprotected CMMs, respectively. 

In summary, the strategy of site-directed mutagenesis combined with 
chemical modification was exploited for the site-selective glycosylation of SBL. This 
15 method was general, versatile and allowed the preparation of pure glycoforms which 
constitute the first examples of regio- and glycan- specific protein glycosylation at 
predetermined sites. Careful control of a novel SBL-catalyzed intramolecular 
deacetylation greatly expanded the scope of this method and through reaction of SBL- 
N62C, -S 1 56C, -S 1 66C, and -L2 1 7C with peracetylated MTS reagents 1 a,g-k (See Figure 
20 8) allowed the introduction of glycans with precisely modulated levels of acetylation. 

The glycosylated CMMs formed display kJK M values that ranged from 
1 .1 -fold higher than WT to 7-fold lower than WT. Without the use of this highly 
selective glycosylation technique, the determination of such precise trends would be 
unachievable and variations caused by previous non-specific glycosylation could only be 
25 interpreted in a general manner. It has been demonstrated that subtle differences in 
carbohydrate structure may be used to fine tune the activity of SBL. For example, the 
anomeric stereochemistry of the glycans introduced modulated changes in k^JK^ upon 
acetylation. At positions 62 and 217, acetylation enhanced the activity of a-tethered 
CMMs but decreased that of 0-tethered. This trend was reversed at position 166 where, in 
30 contrast, acetylation enhanced the kJK h p of P-tethered CMMs but decreased those of ct- 
tethered. Consistent with its surface exposed nature, changes at position 156 were more 
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modest, but still allowed control of activity particularly through glycosylation with 
disaccharide lactose. These results illustrated the great potential for tailoring activity 
through the correct choice of glycan and glycosylation site. 

The ability of the glycosylation method to glycosylate the binding pockets 

5 of SBL also creates opportunities to broaden its substrate specificity. For instance, an 

array of hydrogen bonding hydroxyl groups may broaden its specificity towards hydrogen 
bonding substrates such as glycosylated amino acids. Subtilisins have been elegantly 
used to catalyze the synthesis of glycopeptides (Witte et al., "Solution- and Solid-Phase 
Synthesis of N-protected Glycopeptide Esters of the Benzyl Type as Substrates for 

10 Subtilisin-Catalyzed Glycopeptide Couplings." J. Am. Chem. Soc. 120:1979-1989 

(1998); Wong et al., "Enzymatic-Synthesis of N-Linked and O-Linked Glycopeptides," L 
Am. Chem. Soc. 1 15:5893-5901 (1993), which are hereby incorporated by reference). 
However, the natural specificity of these enzymes has limited these peptide ligations to 
those in which the glycosylated residues are at least one residue distant (P 2 ,P 3 ... or 

1 5 P 2 ',P 3 \ . .) from the amide bond formed. For example, while ligation of Z-Gly-OBz with 
H-Gly-Ser(Ac 3 GlcNAcP)-NH 2 was successful, no yield of product was obtained with H- 
Ser(Ac 3 GlcNAcP)-NH 2 (Witte et al., J. Am. Chem. Soc . 120:1979-1989 (1998), which is 
hereby incorporated by reference). The introduction of sugars to the S, and S,' subsites as 
hydrogen bonding groups demonstrated here may enhance the specificity of proteases 

20 towards hydrophilic substrates. 

Furthermore, by choosing carbohydrate attachments that differ from each 
other at only one stereocenter, SARs may be determined by examining changes in activity 
as the nature of sugar side chain is varied. For example, the effect of inverting 
stereocenters in the order C-4-»C-l->C-2 can be determined using CMMs in the series e 

25 c -> b-> d (See Figure 8). While the current illustrations have been with SBL as a 

protein example, the method is clearly amenable to the glycosylation of any protein and is 
without limitation with respect to the sites and to the glycans that may be conjugated. It 
will, therefore, allow the introduction of any therapeutically important carbohydrate 
recognition determinant, of which the 0-D-galactopyranosyl moiety of e and f (See Figure 

30 8) that represents a ligand of the hepatic asialoglycoprotein receptor (Sharon et al., Essays 
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Biochemu 30:59-75 (1995), which is hereby incorporated by reference) is just one 
example. 

Example 12 - Esterase Screen 

5 Specificity constants determined using the low substrate approximation 

were measured indirectly using Ellman's reagent (Ellman et al., Biochem. Pharmacol., 
7:88-95 (1961), which is hereby incorporated by reference) (e 4l2 = 13600 M^cm' 1 ) using 
0.15 and 0.30 mM succinyl-AAPF-SBn as substrate in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

10 

Example 13 - Full Esterase Kinetics Measurements 

Michaelis-Menten constants were measured at 25 °C by curve fitting 
(Grafit® 3.03, Erithacus Software Ltd., Staines, Middlesex, UK) of the initial rate data 
determined at eight concentrations (31.25 ^iM - 3 mM) of the succinyl-AAPF-SBn 
15 substrate, followed indirectly using Ellman's reagent in 0.1 M Tris.HCl, containing 0.005 
vol% DMSO, 1 vol% 37.5 mM Ellman's reagent in DMSO, pH 8.6. 

Example 14 - Esterase Activity Screen 

The glyco-CMMs shown in Table 4 were prepared, by reacting reagents 
20 la-k (See Figure 8) with the chosen cysteine mutants SBL-N62C, -S156C, -S166C and - 
L217C, purified and extensively characterized as described previously. 
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Table 4. 

Esterase Screen Results for Glycosylated CMMs 3 



N62C-R 



S166C-R SI56C-R 



-R 




E/A b 




(s'mM 1 ) 




H 


4380 


48 


-SPGlc 


- 




B 


5327 


37 


C 


6625 


62 


D 


4627 


36 


E 


5729 


48 


F 


5990 


48 


-SpGlc(Ac) 2 






-SPGIc(Ac) ; 






-SPG!c(Ac) 4 


2502 


19 


(-a) 






SetaGlc(Ac) 2 






SetaGlc(Ac) 5 








6851 


30 


SetaGlc(Ac) 4 






(-g) 




36 




1925 


SelpGlc(Ac) 2 








2736 


51 



(s-'mM- 1 ) 


E/A b 


(s-'mM' 1 ) 


E/A b 


KJK KI 
(s-'mM' 1 ) 


E/A b 


5540 


109 


350 


4 




- 


6350 


186 










7773 


90 


2688 


37 


1984 


22 


9791 


130 


2732 


40 


2197 


21 


8651 


77 


2924 


45 


2648 


24 


1 1923 


163 


3241 


56 


2505 


28 


9080 


137 


2511 


44 


1692 


24 


8776 


86 










11642 


115 














2537 


50 


1356 


17 


7893 


79 














1829 


45 


- 


- 


- 


- 


- 


- 


1887 


34 




_ 


5058 


46 


- 


- 


4935 


156 


6033 


54 


- 


- 










2321 


26 


11751 


71 


1537 


37 














2629 


31 


3889 


116 


3159 


38 














2010 


24 






1543 


38 







10 



5807 50 
2169 50 



SetpGlc(Ac) 3 

SetpGlc(Ac) 4 
(-h) 

SetaMan(Ac) 

5 

SetaMan(Ac) 
4<-0 

SetpGal(Ac) 3 

SetPGal(Ac)^ 
(-j) 

-SetLa^Ac), 

.SetLac(Ac) t - - 5468 139 - - - 

-SetLac(Ac), 2135 29 * " 13,6 

• Setic constants determined in duplicate using the low substrate concentrat.on ap P™»™ tio "_ ™f • [ * JS^O 
pH 8.6. 0.005% Tween 80, I %DMSO with suc-AAPF-SBn as substrate. [S]= 15 or 30 M M. [E] - 4.8x10 toto.u 

x I0 IU M. 

" E/A = (KJKh)^ /ikJK„)^. 

« For SBL-WT: k„/K u = 3592.5 mM's', E/A = 17. 

The kinetic parameters of esterase activity were determined at pH 8.6 by indirectly 
following the release of thiobenzyl alcohol from the substrate succinyl-Ala-Ala-Pro-Phe- 
SBn (suc-AAPF-SBn) with Ellman's reagent (Ellman et al., Biochem. Pharmacol., 7:88- 
95 (1961), which is hereby incorporated by reference). To allow a rapid screen of 
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esterase activity, a low substrate concentration ([S] « was used that allowed kJK hl 
to be determined directly from the initial rate of reaction. The results from the screen are 
shown in Table 4. 

Modification at position 62, in the S, pocket, with deprotected sugar 
5 reagents 1 b-f (See Figure 8) increased kJK^ resulting in a series of five enzymes that 
had similar kJK K fi that were 1.3- to 1.9-fold greater than WT (Figure 13 A). The 
presence of an ct-linkage was clearly deleterious to activity, as N62C-SEtpGlc (-c) (See 
Figure 8) had a kJK K , 1 .2-fold greater than its epimer N62C-SEtaGlc (-b) (See Figure 8) 
and 1.9-fold greater than WT. Furthermore, the a-linked N62C-SEtaMan (-d) (See 
l o Figure 8) had the lowest kJK M in this group which was 1 .3-fold greater than WT. 

As at position 62, the introduction of any of the deprotected sugar side 
chains b-f (See Figure 8) at position 217, in the S,' pocket increased k c JK M (Figure 13B). 
However, the effects of glycosylation at this site were far more dramatic as demonstrated 
by a kJK M for L217C-SEtfJGal (-e) (See Figure 8) that was 3.4-fold greater than WT. By 
15 comparing the kJK^ of L217C-SpGlc and L217C-SEtpGlc (-c) (See Figure 8), it was 
possible to gauge the effect on activity of introducing an ethyl tether at this position. This 
introduction increased kJK^ from 1.8-fold greater than WT for L217C-SpGlc to 2.7- 
fold greater than WT for L217C-SEtpGlc (-c) (See Figure 8). As at position 62, P-linked 
glyco-CMMs (-c,-e,-f) (See Figure 8) had higher kJK K £ than the a-linked ones (-a,-d) 
20 (See Figure 8). For example, L2 1 7C-SEt|5Glc (-c) (See Figure 8) had a kJK M 1 .3-fold 
greater than L21 7C-SEtccGlc (-b) (See Figure 8). 

Consistent with the surface exposed orientation of the SI 56 side chain, the 
S156C deprotected glyco-CMMs had similar kJKtf that were 1.3- to 2.1 -fold lower than 
WT (Figure 13C). 

25 At position 1 66, in the S, pocket, mutation to cysteine resulted in an 

enzyme with a severely lowered kJK M that was 1 0-fold lower than WT. However, 
subsequent modification with lb-f (See Figure 8) restored much of the catalytic activity 
(Figure 13D), and the S166C deprotected glyco-CMMs S166C-S-b-f (See Figure 8) had 
similar kJK„fi, that varied from 1 .1- to 1 .4-fold lower than WT. 

30 
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Example 15 - Kinetic Effects of Glycosylation with Acetylated Carbohydrates. 

At position 62, in the S 2 pocket, in sharp contrast to the trend observed for 
the unacetylated N62C CMMs, acetylated N62C CMMs had a wide range of k L JK h p. 
Introduction of acetates increased the k c JK A p of the ot-Iinked CMMs relative to the 
5 corresponding deprotected glyco-CMMs (Figure 14A). Thus, N62C-SEtaGlc(Ac) 4 (-b) 
(See Figure 8) and N62C-SEtccMan( Ac) 4 (-d) (See Figure 8) had k c JK A fi 1 .9- and 1 .6-fold 
greater than WT, respectively. Acetylation was clearly deleterious for p-linked CMMs, as 
N62C-SEtpGal(Ac) 3 ,N62C-SEtpGlc(Ac) 3 , and N62C-SEtpGlc(Ac) 5 all had kJK K p 
lower than WT. However, increasing the number of acetates present on the CMM 

10 restored activity: N62C-SEtpGlc(Ac) 3 had a kJK M only 1.3-fold lower than WT and 1.5- 
fold higher than N62C-SEtpGlc(Ac) 2 . In spite of their size, the sterically bulky side chain 
lactosylated N62C CMMs, N62C-SEtLac (-f) (See Figure 8) and N62C-SEtLac(Ac) 7 (-k) 
(See Figure 8) had kJK^ that were similar to those of the CMMs derived from 
monosaccharides. This provided a clear example of the versatility of the glycosylation 

15 method illustrated in Figure 8 and demonstrated that by using this method it was possible 
to introduce very large structures into the active site of SBL while maintaining catalytic 
competency. 

Modification with acetylated reagents la,g-k (See Figure 8) at position 
217, in the S,' pocket, as with lb-k (See Figure 8), led to CMMs with greater than WT 

20 k c JK K fi (Figure 14B). For the untethered glyco-CMMs, increasing the number of acetates 
dramatically increased kJK^ from 1 .8-fold greater than WT for L21 7C-SpGlc to 2.4- 
fold greater than WT for L217C-SpGlc(Ac) 2 and to 3.2-fold greater than WT for L217C- 
SpGlc(Ac) 3 and mirrored the trend seen in amidase kinetics. For the ethyl linked L217C 
glyco-CMMs, the effect of acetylation was dependent on the anomeric stereochemistry, as 

25 observed for N62C glyco-CMMs and the L2 1 7C glyco-CMMs amidase kinetics. 
Acetylation of a-linked CMMs increased kJK M but decreased kJK M for p-linked 
CMMs. This was most pronounced for L217C-SEtpGal(Ac) 3 which had a kJK h{ only 
1 . 1 -fold greater than WT and 3. 1 -fold lower than L2 1 7C-SEtpGal (-e). In contrast to the 
effect on deprotected L217C glyco-CMMs, the activity of acetylated L217C glyco-CMMs 

30 decreased upon the introduction of the ethyl linker. For example, L217C-SpGlc(Ac) 3 had 



0001712A3_IA> 



WO 00/01712 



PCTAJS99/15138 



-54- 

a kJK Ki 3.2-fold greater than WT, as compared with the 1 .4-fold greater than WT KJK M 
ofL217C-SEtpGlc(Ac) 3 . 

At position 156, in the S, pocket, the S156C acetylated glyco-CMMs 
displayed little difference in their kinetic constants from their unacetylated counterparts 

5 (Figure 1 4C), an observation that was consistent with the surface exposed nature of the 
position 156 side chain. Introducing the ethyl linker led to an increase in kJK„, from 2.6- 
fold lower than WT for S156C-SpGlc(Ac) 4 (-a) (See Figure 8) to 1.5-fold lower than WT 
for S156C-SEtpGlc(Ac) 4 (-h) (See Figure 8). 

In general, at position 166, in the S, pocket, the effect of acetylation on 

10 SI 66C ethyl linked glyco-CMMs was to reduce kJK^fi relative to their deprotected 

counterparts (Figure 14D). The exceptions were the ethyl linked p-gluco-CMMs, S166C- 
SEtpGlc(Ac), and S166C-SEtpGlc(Ac) 3 , which displayed kJK K f> 1.4- and 1.7-fold 
greater than WT, respectively. These were the only two glyco-CMMs prepared at this site 
to show an enhancement in kJK M relative to WT, and this example illustrates that the 

1 5 correct selection of sugar is crucial to the tailoring of enzyme activity. In contrast to the 
effects observed at positions 62 and 21 7, an ct-linkage to the sugar moiety was deleterious 
to the activity of the acetylated CMMs and S166C-SEtaGlc(Ac) 3 had a kJK M 1 .9-fold 
lower than WT, in direct contrast to SI 66C-SEtpGlc(Ac) 3 . Introduction of the sterically 
bulky lactose moiety, in both acetylated and unacetylated forms, led to CMMs S166C- 

20 SEtLac (-1) (See Figure 8) and S166C-SEtLac(Ac) 5 with low kJK h js that were 1 .4- and 
2.3-fold lower than WT, respectively. 

Example 16 - Full Esterase Kinetics. 

The three esterases with the highest kJK^s determined by the above 
25 screen, L2 1 7C-SpGlc(Ac) 3 , L2 1 7C-SEtaMan(Ac) 3 , and L2 1 7C-SEtpGal (-e) (See Figure 
8) had their individual k,js and K A /> determined by the initial rates method. The results 
are shown in Table 5. 
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Table 5. 

Full Esterase Kinetic Data for Glycosylated CMMs* 



Enzyme 


*™(s-') 


K„(mM) 


(mM's') 


xWT 


E/A b 


WT 

L217C-SpGlc(Ac) 3 
L2 1 7C-SEtaMan(Ac)j 
L217C-SetpGal 


1940.0±180 
4427.5± 100.9 
3827.0±59.5 
4398.5±189.8 


0.54±0.07 
0.15±0.01 
0.30±0.01 
0.36±0.04 


3592.5±572.7 
29516.7±2079.0 

12756.7*469.2 
12218.1±1456.3 


1 

8.4 

3.6 
3.5 


17 

293 
77 
167 


3 Kinetic constants determined by methoc 


I of initial rates in 0.1 M Tris buffer, pH 8.6, 



0.005% Tween 80, 1% DMSO with suc-AAPF-SBn as substrate. [S] = 30 uM to 2 mM, 
5 [E]= 9.6 x 10 " to 1.1 xl0- ,0 M. 

esterase ^ (^ftf/^w)amidase* 

The results were in good agreement with those determined by the screen 
for L217C-SEtaMan(Ac) 3 and L217C-SEtpGal (-e) (See Figure 8) and confirmed the 

10 activity of these two enzymes to be 3.6- and 3.5- fold higher than WT, respectively. 
These increases in activity arose from both increased transition state stabilization, with 
k c js 2- and 2.3-fold greater than WT, respectively, and from greater substrate binding, 
with K K £ 1 .8- and 1 .5-fold lower than WT, respectively. 

Remarkable results were obtained for L21 7C-SpGlc(Ac) 3 . This enzyme 

15 had a 2.3-fold greater than WT and a K M 3.6-fold lower than WT, giving a kJK M 8.4- 
fold greater than WT and some 2.5-fold greater than the value estimated by the screen. 
The difference in parameters obtained from the screen and the full kinetic analysis 
exposed a limitation of the low substrate screen. For the low substrate approximation to 
be accurate, the substrate concentration must be small compared to The K M of 

20 L217C-SpGlc(Ac) 3 (0.1 5 mM) was evidently so small that the approximation did not hold 
in this case. This was the largest enhancement of activity relative to WT achieved using 
the combined site-directed mutagenesis and chemical modification strategy. 

Example 17 - Esterase Activity versus Amidase Activity. 

25 The differing effects of glycosylation upon amidase and esterase k^K^ 

can be compared in an informative manner using the (* c „/^w)e*enise I (KJ^m)^^ ratio, 
E/A (see Figure 15). 



BNSDOCID: <WO. 



0001712A3JA> 



WO 00/01712 



PCT/US99/15138 



-56- 

All N62C deprotected glyco-CMMs had E/As that were enhanced relative 
to WT. The increase in E/A was dependent on the presence of either an a- or a pMinkage 
with the p-linked CMMs N62C-SEtpGlc (-c) (See Figure 8), -SEtpGal (-e) (See Figure 
8), -SEtLac (-f) (See Figure 8) having higher E/As than the ot-linked CMMs N62C- 
5 SEtaGlc (-b) (See Figure 8) and -SEtaMan (-d) (See Figure 8) (Figure 1 5A). These 
increased ratios were due to both increases in esterase kJK^ and reductions in amidase 

As observed for the modifications made at position 62, glycosylation at 
position 21 7, in the S,' pocket, led to enzymes that had greatly increased E/A ratios 

1 o relative to WT. Mutation to cysteine at position 2 1 7 increased E/A to 6.4-fold greater 
than WT. Modification with unacetylated p-linked sugars -SpGlc, -SEtpGlc (-c) (See 
Figure 8), -SEtpGal (-e) (See Figure 8), -SEtLac (-1) (See Figure 8) further increased E/A 
(Figure 1 5B). In contrast, a-linked glyco-CMMs had lower E/A values than that of the 
mutant. These changes in E/As were a result of parallel changes in esterase k c JK A <s that 

1 5 were further amplified by opposing changes in amidase k JK.fi. Introduction of an ethyl 
linker reduced the E/A from 10.9-fold greater than WT for L217C-SpGlc to 7.6-fold 
greater than WT for L217C-SEtpGlc (-c) (See Figure 8). 

At position 156, in the S, pocket, E/A ratios for unacetylated glyco-CMMs 
were all similar to each other in the range 1 .2- to 1 .6-fold greater than WT (Figure 15C). 

2 0 The mutant S 1 66C had an exceptionally low E/A that was 4.2-fold lower 

than WT (Figure 1 5D) and this was largely a result of its very low esterase kJK^ 
Because modification of S166C restored esterase kJK^ to levels approaching that of 
WT and because S166C glyco-CMMs were poor amidases relative to WT. the net result 
was a family of CMMs with similar E/A ratios that were all enhanced relative to WT and 

25 significantly higher than the cysteine mutant (Figure 1 5D). 

Example 18 -Effects of Glycosylation with Acetylated Carbohydrates on E/A. 

At position 62, in the S 2 pocket, with the exception of N62C-SEtpGal(Ac) 3 
and N62C-SEtaMan(Ac) 4 (-i) (See Figure 8), acetylation of N62C glyco-CMMs led to a 
30 reduction in E/A. However, like their deprotected counterparts, the acetylated N62C 
glyco-CMMs all had larger E/As than WT (Figure 16A). Increasing the level of 
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acetylation increased the E/A ratio from 2.1 -fold greater than WT for N62C- 
SEtpGlc(Ac), to 3.0-fold greater than WT for N62C-SEtpGlc(Ac) 3 . In spite of the 
general increases in E/A observed for the ethyl linked glyco-CMMs, the untethered CMM 
N62C-SPGlc(Ac) 4 (-a) (See Figure 8) had an E/A very similar to WT. 

5 At position 217, in the S,' pocket, L217C acetylated glyco CMMs all had 

enhanced E/As relative to WT (Figure 16B). As for their deprotected counterparts, P- 
linked acetylated glyco-CMMs had higher E/As than that of the L21 7C mutant, whereas 
the E/As of the a-linked acetylated glyco-CMMs were lower than that of the mutant. In 
fact, modification at this site produced CMMs with by far the greatest enhancement of 

10 this ratio and the E/A for L2 1 7C-SpGlc(Ac) 3 was 1 7.2-fold greater than WT. In contrast 
to the L217C deprotected glyco-CMMs, introducing an ethyl linker lowered E/A from 
1 7.2-fold greater than WT for L21 7C-SpGlc(Ac) 3 to 9.2-fold greater than WT for L21 7C- 
SEtpGlc(Ac) 3 . For the untethered L217C acetylated glyco-CMMs, increasing the number 
of acetates also increased E/A, from 5.0-fold greater than WT for L217C-SEtpGlc(Ac), to 

15 1 7.2-fold greater than WT for L21 7C-SEtpGlc(Ac) 3 . 

At position 156, in the S, pocket, acetylation of a-linked glyco CMMs 
increased E/As (Figure 16C). Hence, S156C-SEtpGlc(Ac) 4 (-h) (See Figure 8) had an 
E/A that was 1 .5 fold greater than WT whereas SI 56C-SEtaGlc(Ac) 4 (-g) (See Figure 8) 
had an E/A 2.0-fold greater than WT. S156C-SEtaMan(Ac) 4 (-i) (See Figure 8) also had 

20 an E/A 1 .3-fold greater than that of its unacetylated counterpart, SI 56C-SEtaMan (-d) 
(See Figure 8). 

At position 1 66, in the S, pocket, acetylation caused an increase in E/A for 
the glucosylated CMMs, irrespective of the anomeric stereochemistry (Figure 16D). 
Acetylation of all other sugar moieties led to a reduction in E/A. Increasing the number 
25 of acetates increased E/A from 2.7-fold greater than WT for S166C-SEtpGlc(Ac) 2 to 3.2- 
fold greater than WT for SI 66C-SEtpGlc(Ac) 3 . 

Example 19 - Molecular Modeling 

The X-ray structure of subtilisin Bacillus lentus with the peptide inhibitor 
30 AAPF bound (Brookhaven database entry 1 JEA) was used as the starting point for 

calculations on wild type and CMMs. The enzyme setup was performed with Insight II, 
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version 2.3.0 (Biosym Technologies, Inc. San Diego, CA). To create initial coordinates 
for the minimization, hydrogens were added at the pH 8.6 used for kinetic measurements. 
This protonated all Lys and Arg residues and the N-terminus and deprotonated all Glu and 
Asp residues and the C-terminal carboxyl group. The protonated form of His 64 was used 
in all calculations. The model system was solvated with a 5 A layer of water molecules. 
The total number of water molecules in the system was 1 143. The overall charge of the 
enzyme-inhibitor complex resulting from this setup was +4 for the WT enzyme. Energy 
simulations were performed with the DISCOVER program, Version 2.9.5 (Biosym 
Technologies, Inc., San Diego, CA) on a Silicon Graphics Indigo computer, using the 
consistent valence force field (CVFF) function. A non-bonded cutoff distance of 1 8 A 
with a switching distance of 2 A was employed. The non-bonded pair list was updated 
every 20 cycles and a dielectric constant of 1 was used in all calculations. The WT 
enzyme was minimized in stages, with initially only the water molecules being allowed to 
move, followed by water molecules and the amino acid side chains, and then finally the 
entire enzyme. The mutated and chemically modified enzymes were generated by 
modifying the relevant amino acid using the Builder module of Insight. These structures 
were then minimized in a similar manner. Initially, the side-chain of the mutated residue 
and the water molecules were minimized. Then, all side-chains and the water molecules 
were minimized while the backbones of the residues were constrained, then all of the 
atoms were minimized. The AAPF inhibitor was free to move throughout all stages of 
the minimization. Each stage of energy minimization was conducted by means of the 
method of steepest descents without Morse or cross terms until the derivative of energy 
with respect to structural perturbation was less than 5.0 kcal/A; then, the method of 
conjugate gradients, without Morse or cross terms until the derivative of energy with 
respect to structural perturbation was less than 1 .0 kcal/A; and, finally, the method of 
conjugate gradients, with Morse and cross terms until the final derivative of energy with 
respect to structural perturbation was less than 0.1 kcal/A. 

The molecular basis for the vastly improved esterase activities observed 
was analyzed by molecular modeling of the peptidyl product inhibitor AAPF bound to the 
SBL-CMM L217C-SBGlc(Ac) 3 . While the substrate employed for kinetic analysis is 
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succinylated, this moiety was not included in molecular modeling since its orientation 
was not reported in the X-ray structure of SBL, suggesting high mobility in the crystal. 

As shown in Figures 17 and 18, molecular modeling analysis of L217C- 
SpGlc(Ac) 3 revealed that the observed kJK M changes correlate with the occupation of the 

5 S,' pocket of SBL by the glucosylated -SpGlc(Ac) 3 side chain. In the minimized structure 
(Figure 1 7), the position of the glucose moiety was fixed by a network of hydrogen- 
bonding interactions (shown as white dotted lines) between the oxygen atoms on the C-3, 
4, and 6 substituents of glucose and water molecules in the surrounding external solvent. 
This extensive solvation directed the C-2 substituent of glucose internally towards the 

10 catalytic triad. In this orientation, the carbonyl oxygen atom of the C-2 acetate group acts 
a hydrogen bond (1.89 A) acceptor and stabilizes water molecule 127 in close proximity 
to the carboxy terminus of AAPF, used here as a substrate analog for modeling, and is 
shown in Figure 1 7 hydrogen-bonding to the O atom of the carboxylic acid. 

These results suggested that firstly the low amidase activity of L217C-S- 

15 Glc(Ac) 3 , which was 2-fold lower than WT, was a result of the S,' pocket being occupied 
by the glucose moiety at position 217. This prevents efficient binding of the/?NA leaving 
group and, therefore, decreases the rate of acyl-enzyme intermediate formation which is 
the rate-determining step for amidase activity. Secondly, after the pUA has been 
displaced to form the covalent Acyl-Ser221 intermediate, the glucose moiety stabilizes a 

20 crucial, nucleophilic water molecule (Wat 127) in close proximity to the carbonyl carbon 
atom, through a hydrogen-bond to the oxygen of the C-2 acetate of glucose, as illustrated 
in Figure 18. This facilitates hydroylsis of the acyl-enzyme intermediate and therefore 
increases the rate of deacylation, which is the rate limiting step for esterase activity 
(Zerner et aL, J. Am. Chem. Soc . 86:3674-3679 (1 964); Whitaker et aL, J. Am. Chem. 

25 Soc,, 87:2728-2737 (1965); Berezin et al., FEBS Lett. , 15:121-124 (1971), which are 
hereby incorporated by reference). 

Glycosylation of SBL at sites within the active site can dramatically 
enhance its esterase activity. The library of glycosylated CMMs synthesized using the 
combined site directed mutagenesis and chemical modification strategy contains 22 

30 CMMs with greater than WT activity. Glycosylation at positions 62, in the S 2 pocket, and 
217, in the S," pocket, gave the greatest increases in kJK M . The most active CMM 
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L217C-SpGlc(Ac), had a kJK M that was 8.4-fold greater that WT and was the most 
active esterase synthesized using this approach. When surface exposed position 1 56 was 
glycosylated, there was little alteration in activity, and this demonstrates that the 
introduction of sugars at such sites has little effect on the catalytic activity of SBL. 

In addition to the tailoring of esterase kJK Si values, glycosylation also led 
to enormous improvements in specificity for ester versus amide hydrolysis, as determined 
by measurement of the ratio (kJKJ^ I (KJK,,)^, E/A. This ratio has been 
increased to 1 7.2-fold greater than WT for L217C-S|3Glc(Ac) 3 . Such enzymes are very 
attractive candidates for use in peptide synthesis, where a high esterase to amidase ratio is 
desirable. Furthermore, the CMMs described above have an even greater potential for 
this purpose, as the increases in E/As have been achieved by an increase in the catalytic 
efficiency of these enzymes towards ester substrates, in addition to a reduction in the 
amidase activity. 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations can 
be made therein by those skilled in the art without departing from the spirit and scope of 
the invention which is defined by the following claims. 
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WHAT IS CLAIMED : 

1 . A chemically modified mutant protein, said mutant protein 
comprising a cysteine residue substituted for a residue other than cysteine in a precursor 
protein, the substituted cysteine residue being subsequently modified by reacting said 

5 cysteine residue with a glycosylated thiosulfonate. 

2. A chemically modified mutant protein according to claim 1, 
wherein the protein is an enzyme. 

I o 3 . A chemically modified mutant protein according to claim 2, 

wherein the enzyme is a protease. 

4. A chemically modified mutant protein according to claim 3, 
wherein the protease is a Bacillus lentus subtilisin. 

5. A chemically modified mutant protein according to claim 1 , 
wherein said thiosulfonate comprises an alkylthiosulfonate. 

6. A chemically modified mutant protein according to claim 5, 
wherein said alkylthiosulfonate comprises methanethiosulfonate. 

7. A chemically modified mutant protein according to claim 1, 
wherein the residue other than cysteine is an amino acid selected from the group 
consisting of asparagine, leucine, and serine. 

8. A chemically modified mutant protein according to claim 1 , 

25 wherein the residue other than cysteine is in a substrate binding subsite of the protein. 

9. A chemically modified mutant protein according to claim 1 , 
wherein the glycosylated thiosulfonate comprises a thiol side chain comprising -S-p-Glc, 
-S-Et-p-Gal, -S-Et-p-Glc, -S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-(J-Glc(Ac) 2 , 

30 -S-p-Glc(Ac) 3 , -S-P-Glc(Ac) 4 , -S-Et-ct-Glc(Ac) 2 , -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , 
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-S-Et-(J-Glc(Ac) 2 , -S-Et-p-Glc(Ac) 3 , -S-Et-p-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , 
-S-Et-cc-Man(Ac) 4 , -S-Et-p-Gal(Ac) 3 , -S-Et-p-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , 
or -S-Et-Lac(Ac) 7 . 

10. A chemically modified mutant protein according to claim 1, 
5 wherein the carbohydrate moiety is a dendrimer moiety. 

11. A method of producing a chemically modified mutant protein 
comprising the steps of: (a) providing a precursor protein; (b) substituting an amino acid 
residue other than cysteine in said precursor protein with a cysteine; (c) reacting said 
substituted cysteine with a glycosylated thiosulfonate, said glycosylated thiosulfonate 

10 comprising a carbohydrate moiety; and (d) obtaining a modified glycosylated protein 
wherein said substituted cysteine comprises a carbohydrate moiety attached thereto. 

12. A method according to claim 1 1, wherein said thiosulfonate 
comprises an alkylthiosulfonate. 



15 
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25 



13. A method according to claim 12, wherein said alkylthiosulfonate 
comprises a methanethiosulfonate. 

14. A method according to claim 1 1, wherein the protein is an enzyme. 

15. A method according to claim 14, wherein the enzyme is a protease. 

1 6. A method according to claim 1 5, wherein the protease is a Bacillus 
lentus subtilisin. 

1 7. A method according to claim 1 1 , wherein the amino acid residue 
other than cysteine is an amino acid selected from the group consisting of asparagine, 
leucine, and serine. 
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1 8. A method according to claim 1 1 , wherein the amino acid residue 
other than cysteine is in a substrate binding subsite of the protein. 

1 9. A method according to claim 1 1 , wherein the glycosylated 
thiosulfonate comprises a thiol side chain comprising -S-P-Glc, -S-Et-p-Gal, -S-Et-(5-Glc, 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-(J-Glc(Ac) 2 , -S-p-Glc(Ac) 3 , -S-p-Glc(Ac) 4 , 
-S-Et-a-Glc(Ac) 2 , -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , -S-Et-p-Glc(Ac) 2 , 
-S-Et-P-Glc(Ac) 3> -S-Et-p-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , -S-Et-a-Man(Ac) 4 , 
-S-Et-p-Gal(Ac) 3 , -S-Et-p-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

20. A method according to claim 1 1, wherein the carbohydrate moiety 
is a dendrimer moiety. 

21 . A glycosylated thiosulfonate comprising: 

O 

II 

H 3 C— |— SR 
O 

wherein R comprises -p-Glc, -Et-P-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-P-Glc(Ac) 2 , -p-Glc(Ac) 3 , -p-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac) 3 , -Et-a-Glc(Ac) 4 , 
-Et-p-Glc(Ac) 2 , -Et-p-Glc(Ac) 3 , -Et-p-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , 
-Et-P-Gal(Ac) 3 , -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

20 

22. A method of modifying the functional characteristics of a protein 

comprising: 

providing a protein and 

reacting the protein with a glycosylated thiosulfonate reagent under 
25 conditions effective to produce a glycoprotein with altered functional characteristics as 
compared to the protein. 

23. A method according to claim 22, wherein the protein is an enzyme. 
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24. A method according to claim 23, wherein the enzyme is a protease. 

25. A method according to claim 24, wherein the protease is a Bacillus 
lentus subtilisin. 

26. A method according to claim 22, wherein the glycosylated 
thiosulfonate comprises: 

O 
II 

H 3 C— |— SR 
O 

wherein R comprises -0-Glc, -Et-p-Gal, -Et-p-Glc, -Et-a-Glc, -Et-a-Man, -Et-Lac, 
-p-Glc(Ac),, -P-Glc(Ac) 3 , -p-Glc(Ac) 4 , -Et-a-Glc(Ac) 2 , -Et-a-Glc(Ac),, -Et-a-Glc(Ac),, 
-Et-p-Glc(Ac) 2 , -Et-P-Glc(Ac) 3 , -Et-P-Glc(Ac) 4 , -Et-a-Man(Ac) 3 , -Et-a-Man(Ac) 4 , 
-Et-p-Gal(Ac) 3 , -Et-p-Gal(Ac) 4 , -Et-Lac(Ac) 5 , -Et-Lac(Ac) 6 , or -Et-Lac(Ac) 7 . 

27. A method of determining the structure-function relationships of 
15 chemically modified mutant proteins comprising: 

providing first and second chemically modified mutant proteins according 
to claim 1 , wherein the glycosylation pattern of the second chemically modified mutant 
protein is different from the glycosylation pattern of the first chemically modified mutant 
protein; 

20 evaluating a functional characteristic of the first and second chemically 

modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
mutant proteins. 

25 

28. A method according to claim 27, wherein the protein is an enzyme. 



29. A method according to claim 28, wherein the enzyme is a protease. 
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30. A method according to claim 29, wherein the protease is a Bacillus 
lentus subtilisin. 

31. A method according to claim 27, wherein the residue other than 
cysteine is an amino acid selected from the group consisting of asparagine, leucine, and 
serine. 

32. A method according to claim 27, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 



33. A method according to claim 27, wherein the glycosylated 
thiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-P-Gal, -S-Et-p-Glc, 
-S-Et-a-Glc, -S-Et-a-Man, -S-Et-Lac, -S-p-Glc(Ac) 2 , -S-f5-Glc(Ac) 3 , -S-p-Glc(Ac) 4 , 
-S-Et-ct-Glc(Ac) 2 , -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , -S-Et-p-Glc(Ac),, 

1 5 -S-Et-p-Glc(Ac) 3 , -S-Et-p-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , -S-Et-ct-Man( Ac) 4 , 

-S-Et-p-Gal(Ac) 3 , -S-Et-p-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

34. A method according to claim 27, wherein the carbohydrate moiety 
is a dendrimer moiety. 

20 35. A method of determining the structure-function relationships of 

chemically modified mutant proteins comprising: 

providing first and second chemically modified mutant proteins according 
to claim 1, wherein at least one different cysteine residue in the second chemically 
modified mutant enzyme is modified by reacting said cysteine residue with a glycosylated 

25 thiosulfonate; 

evaluating a functional characteristic of the first and second chemically 

modified mutant proteins; and 

correlating the functional characteristic of the first and second chemically 
modified mutant proteins with the structures of the first and second chemically modified 
30 mutant proteins. 
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36. A method according to claim 35, wherein the protein is an enzyme. 



37. A method according to claim 36, wherein the enzyme is a protease. 

5 38. A method according to claim 37, wherein the protease is a Bacillus 

lentus subtilisin. 

39. A method according to claim 35, wherein the residue other than 
cysteine is an amino acid selected from the group consisting of asparagine, leucine, and 

10 serine. 

40. A method according to claim 35, wherein the residue other than 
cysteine is in a substrate binding subsite of the protein. 

15 41. A method according to claim 35, wherein the glycosylated 

thiosulfonate comprises a thiol side chain comprising -S-p-Glc, -S-Et-p-Gal, -S-Et-P-Glc, 
-S-Et-ot-Glc, -S-Et-cx-Man, -S-Et-Lac, -S-p-Glc(Ac) 2 , -S-P-Glc(Ac) 3 , -S-p-Glc(Ac) 4 , 
-S-Et-a-Glc(Ac) 2 , -S-Et-a-Glc(Ac) 3 , -S-Et-a-Glc(Ac) 4 , -S-Et-(J-Glc(Ac) 2 , 
-S-Et-P-Glc(Ac) 3 , -S-Et-p-Glc(Ac) 4 , -S-Et-a-Man(Ac) 3 , -S-Et-a-Man(Ac) 4 , 

20 -S-Et-p-Gal(Ac) 3 , -S-Et-p-Gal(Ac) 4 , -S-Et-Lac(Ac) 5 , -S-Et-Lac(Ac) 6 , or -S-Et-Lac(Ac) 7 . 

42. A method according to claim 35, wherein the carbohydrate moiety 
is a dendrimer moiety. 
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